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Abstract 

General aspects of vielbein representation, ADM formulation and canonical quantization of 
gravity are reviewed using pure gravity in three dimensions as a toy model. The classical part 
focusses on the role of observers in general relativity which will later be identified with quantum 
observers. A precise definition of gauge symmetries and a classification of inequivalent solutions of 
Einstein's equations in dreibein formalism is given as well. In the quantum part the construction 
of the physical Hilbcrt space is carried out explicitly for a torus and cylinder type space manifold, 
which has not been done so far. Some conceptual problems of quantum gravity are discussed from 
the point of view of an observer sitting inside the universe. 
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Why three dimensional canonical gravity? 



When you started to learn quantum mechanics as a student, you have certainly studied some of the 
"toy models" appearing in every introductory lecture and every book on quantum physics. These 
standard examples not only provide very useful models to explain the "canonical construction" for 
the quantum state space of a system which is classically defined by an action principle. They also 
played an important role in the development of quantum mechanics itself. Nowadays we are looking 
for a quantum theory of gravity and maybe we can learn something from quantum mechanics. What 
makes the development of quantum gravity a bit different, is that up to now, and possibly also in 
the near future, there are no "key observations" waiting to be explained. Instead, it is merely the 
lack of a mathematically consistent theory of physics at the Planck scale that forces us to look for 
a quantum theory of gravity. It is Einstein's classical theory that is in contradiction with quantum 
physics at small length or high energy scales. Nevertheless there are a few standard toy models which 
are not so different from those considered in quantum mechanics, and studying these can help us to 
get some ideas of how quantum gravity might work and what its main principles are. 

The hydrogen atom of quantum gravity is certainly the radiating black hole. Explaining the 
Hawking radiation, which might in some sense be considered as a key observation, within the frame- 
work of a mathematically consistent model would probably be one of the main steps towards a 
quantum theory of gravity, just like the explanation of the stability and discrete energy states of 
the hydrogen atom was a milestone for quantum mechanics. What we have so far are various semi- 
classical or perturbative models describing Hawking radiation as quantum effects occuring at horizons 
in classical spacetimes, or lower dimensional toy models with very special matter fields that allow a 
consistent quantization and behave similar to black holes in four dimensions. Coming back to quan- 
tum mechanics, these models should be compared with the Bohr model of the hydrogen atom. It 
already gave an intuitive understanding of some properties of atoms, but it was not based on a fully 
consistent theory. However, this article is not about the hydrogen atom. It is about the harmonic 
oscillator. 

Like the harmonic oscillator in quantum mechanics, three dimensional pure gravity is a very nice 
model to study some of the main features of quantum gravity. It has all the typical features of 
gravity which are not present in other field theories like, e.g., quantum electrodynamics. On the 
other hand, all the technical difficulties normally appearing in quantum field theories are absent. For 
example, we do not need to renormalize the fields, all relevant operators will be well defined without 
regularization. In addition, complete solutions at both the classical and the quantum level can be 
given explicitly, so we can study the principal questions having explicit classical solutions and a well 
defined Hilbert space of quantum states (for a review on principal problems of quantum gravity see, 
e.g., |l], ^, ||]). In fact, some of these conceptual problems of quantum gravity can be solved for this 
toy model, and one can expect that similar solutions also exist in the four dimensional theory. 

Gravity is different from other field theories mainly because of its background structure. Typical 
field theories, including the standard model, are based on a flat Minkowski space, and the resulting 
global Poincare symmetry also plays a crucial role in any explicit calculation in these theories. It 
is especially the existence of particles and the perturbative approach based thereon that made the 
standard model so successful. There are several reasons why many attempts to quantize gravity in 
a similar way failed. First of all, there is no such background structure like Minkowski space in 
general relativity. There are no particles and no special field configuration which could serve as a 
starting point for perturbation theory. Moreover, even if one artificially introduces a background 
and describes gravity as a perturbation around a fixed metric, the resulting quantum theory turns 
out not to be renormalizable. Whereas this could be merely a technical difficulty, there is another 
and maybe deeper reason for the failure of these "covariant" quantization methods. When applied 
to a gauge theory, most of them require some kind of gauge fixing, but one always tries to keep the 
manifest Poincare symmetry. It has even become a criterion for a good quantum field theory to be 
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manifestly Poincare covariant. It is a priori not clear how this could work in general relativity, where 
the Poincare group is the gauge group and a global Poincare symmetry does not exist in general. 

So we should either look for something completely new, like string theory, or, and this is what we 
are going to do here, go back to the early days of quantum physics, forgetting everything we know 
about quantum field theory in Minkowski space, and try to construct a quantum theory of Einstein 
gravity. Thereby we will use the main ideas of quantum physics but not the sophisticated methods 
of modern quantum field theory. Our "old fashioned" way of doing quantum physics is known 
from non-relativistic quantum mechanics. Starting from a classical action principle, one derives the 
Hamilton Jacobi formulation, translates phase space coordinates into quantum operators, defines 
quantum states as wave functions on which these operators act, and finally physics is described in 
terms of expectation values of observables, defined with respect to a scalar product on the state 
space. This procedure is know as the "Dirac programme" or "canonical quantization." There 
have been some early attempts to use this method to quantize gravity, based on the Hamilton Jacobi 
or "ADM" formalism derived by Arnowitt, Deser and Misner || |(|. The main result was the famous 
"Wheeler DeWitt equation" Ji], ||, [9|, but for a long time these approaches had no success either, 
and they seemed to be technically even more difficult than the perturbative methods. 

With two important discoveries the situation changed drastically and the "canonical quantization" 
of gravity became a promising field of research again. First it was shown by Ashtekar [11, 12 1 that 
there is a new set of variables for Einstein gravity which simplifies the canonical treatment and hence 
the Dirac programme considerably. Closely related to this Witten []l3| showed that using the dreibein 
and spin connection instead of the metric as the basic fields variables, three dimensional gravity can 
be quantized by straightforward application of Dirac's method. After that, other methods were 
shown to produce similar results [14, |Tq] . 

The aim of this article is to provide a self-contained presentation of the Dirac programme applied 
to three dimensional gravity, emphasizing mostly physical aspects which are interesting for quantum 
gravity in general. All you have to know to read this article is how to quantize systems with gauge 
degrees of freedom in Dirac's formalism [Q, fill , and some basic knowledge about general relativity, 
vielbein formalism, and the ADM formulations of Einstein gravity will be useful, too ||. The new 
results presented here are mainly based on earlier work [17, 18], and also related to similar approaches 
in four dimensions It turns out that the combination of dreibein gravity, ADM formalism, and 
Dirac quantization is an ideal tool helping us to understand the physics of quantum gravity. Our 
approach will be a little bit unconventional, which hopefully makes this article interesting for both 
beginners looking for an introductory review and for experts, who might get some new insight into 
old questions considered from a different point of view. Our point of view will be that of an observer 
sitting inside the universe rather than looking at it from the outside, which is not restricted to three 
dimensions. One of the central questions will be whether we can make this observer a relativistic 
and a quantum observer at the same time. 

Moreover, instead of the "standard" example of a torus or higher genus manifold, we will consider 
non-compact space manifolds. This makes the mathematical treatment more complicated at first 
sight, because we have to introduce boundary conditions, but on the other hand it will be much 
easier to give a complete classification of solutions and, which is even more important, to extract 
the physical properties of these solutions. This is because the "boundaries" of spacetime will behave 
like "asymptotically flat ends," and associated with them there will be something like a "fixed 
stars background," which can be used to synchronize clocks, measure velocities using a generalized 
"redshift" and for many other "astronomical" measurements. So we really want to do physics in our 
three dimensional universe, and notions like "observer" and "measurement" already appearing on 
the classical level will help us to get some insight into the quantum theory. 

Nevertheless we will also focus on some mathematical subtleties, which so far have not been 
considered in detail. They turn out to be important when dealing with non-compact manifolds. The 
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first section will entirely be reserved to some helpful mathematical tools. Having exact mathematical 
definitions of, e.g., a covering manifold and the loop group, will enable us to give more precise 
theorems to classify solutions of the equations of motions and to construct the physical quantum 
state space. Sometimes it is also overlooked that the simply connected group associated with the 
Lorentz algebra in three dimensions is neither SO(l,2) nor SL(2), so we will also discuss this point 
in more detail before going over to physics. 

In section || we will present our "unconventional" approach to general relativity, which makes use 
of both the ADM formulation and the dreibein and connection field as the basic variables. The central 
object in this approach is the "local observer" sitting inside the universe and making experiments in 
his "local laboratory." Gravity will be considered as the theory that describes how these observers 
communicate with each other. As an additional feature of this interpretation of gravity, it will not 
break down if we allow exactly those kinds of singular metrics which are needed to quantize the 
theory based on Witten's idea. An explicit example will be given to show what happens at spacetime 
singularities. 

We will also introduce some non-local objects like parallel transport operators and geodesic dis- 
tances, which are closely related to the astronomical measurements mentioned above. These non-local 
objects are needed because three dimensional gravity is not really a field theory. Einstein's equa- 
tions in three dimensions imply that spacetime is locally flat (see, e.g., J2(J for an introduction to 
lower dimensional gravity and many references). Hence, there are no gravitational waves or forces. 
Spacetime locally looks like Minkowski space (unless we are in the neighbourhood of a singularity), 
and no local observables like curvature exist. Non-trivial solutions to the field equations only occur 
if the spacetime manifold is not simply connected. Then, e.g., transporting a vector once around a 
non-contractible loop will in general result in a rotated vector, and the "angle" of this rotation is one 
of the finitely many degrees of freedom of gravity in three dimensions. 

To identify all such parameters classifying a solution of Einstein's equations modulo gauge trans- 
formations will be the aim of section [||. Here we will need all the mathematical structures introduced 
in section |]. We will find that this classification is much simpler for space manifolds with "asymptotic 
ends" than for those without like the torus. As an example, we will choose an infinitely long cylinder 
as space and construct some "funny spacetimes," showing that there is a rich set of inequivalent 
solutions even for the simplest non-trivial manifold. This will also give us some understanding of the 
physical content of the gauge invariant parameters found to classify the solutions. 

In section || we are going to quantize the model using Dirac's method. As at the classical level, 
the reduction from a field theory to a state space with finitely many degrees of freedom will come out 
automatically. It will be even simpler to identify the quantum state space than to find the classical 
solutions. The result will be a physical state space consisting of "wave functions" depending on finitely 
many variables like in ordinary quantum mechanics, and a set of "observables" acting thereon as linear 
operators. We will also discuss some problems arising with products of non-commuting operators. 
As a consequence, for some of the observables the quantum operator will be defined up to ordering 
ambiguities only, or equivalently up to corrections of the order h. However, we will find that the 
ordering is unique if we want the observables to become hermitian. 

In the last section we will briefly discuss some conceptual problems of quantum gravity, and show 
how to solve them for our toy model. For example, we will describe what the physical content of the 
observables is, and how the "problem of time" can be solved by using special phase space functions 
that behave like clocks. For a cylinder manifold we will find a unique Hilbert space such that all 
observables become hermitian operators, and we will also see that there is a unique physical Hilbert 
space for the torus as well. Though the torus has always been the standard example, an explicit 
construction of the scalar product on the physical state space has not been carried out so far. We 
will find that up to rescaling there is only one reasonable product such that all standard observables 
become hermitian. 
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As we want to be mathematically as precise as possible, you will find some typically mathematical 
structures in this article. The main objects are introduced in "definitions," and a few important 
results are summarized in "theorems." This makes it somehow easier to refer to them later on, and it 
also helps to organize the article. In addition, there are lots of "exercises." They mainly split into two 
categories. The first type just summarizes some simple properties of new symbols or notions, which 
are either easy to prove or interesting but actually not required for our special purpose. Another kind 
of exercise typically consists of a "physical" question. Here it is sometimes better to ask a precise 
question than to describe things lengthly. Some exercises also suggest generalizations or extensions, 
which are mostly straightforward but beyond the scope of this article. 

1 Mathematics 

In the main part of this article we want to focus on the physical aspects of three dimensional gravity, 
so maybe it's best to introduce all the mathematical notions necessary for this purpose in a separate 
section. We then have all the structures at hand when we need them. There are mainly two mathe- 
matical objects playing a crucial role: the loop group (or first fundamental group) of some manifold 
and the three dimensional Poincare group, which is the simply connected Lie group associated with 
the Lie algebra iso(l,2). Introducing these two objects will give us all the relevant mathematical 
structure. 

In principle we should also introduce a number of fibre bundles to allow exact differential geo- 
metrical definitions for the various "fields". However, we will restrict our considerations to models 
based on manifolds with trivial tangent bundles. This class of models is already interesting enough 
for our purpose. All the bundles will then become trivial and sections therein may be treated as 
fields, coordinates are always global etc. The notation (and also the mathematical language) simpli- 
fies considerably. All the constructions made below can nevertheless be extended to "higher genus" 
manifolds with non-trivial tangent bundles, and for those familiar with fibre bundles, sections and 
connections on them, here is a good exercise: 

Exercise 1.1 (Extension to general manifolds) . For all the "fields" introduced in this article, give the 
correct definition in differential geometrical terms (some of them are section, others are connections 
or maps from one fibre into another) , and show that the general results are correct for manifolds with 
non-trivial tangent bundles as well. Sometimes one has to make other restrictions, e.g. the manifold 
has to be connected, orientable, etc. 

1.1 Covering manifold and loop group 

Given a manifold J\f, we introduce its covering manifold M ■ Physicists often define it by somehow 
"unwrapping" M, so that the result is a simply connected manifold which is locally isomorphic to 
J\f . This is rather illustrative, but there is a better "mathematical" definition, which turns out to be 
exactly the right thing for us here and, in addition, which will give us a second structure, the loop 
group, for free. 

Definition 1.1 (Covering Manifold). Let N be a connected manifold (in all our applications two 
dimensional, orientable). A path in N is a (smooth) map a : [0, 1] i— > M- Two paths are equivalent 
(or homotopic) if they can be (smoothly) deformed into each other without varying the end points, 
i.e. 

a~/3 3f : [0, l] 2 - AT, f(0,s) = a(s), f(t, 0) = a(0) = /?(0), 

f(l,8)=/3(s), /(*,!) = a(l) = /3(1). (1.1) 
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If two paths with the same end points are always homotopic, the manifold is called simply connected. 
Now fix a point x £ Af, then the covering manifold is the set of all homotopy classes of paths beginning 
at x: 

Af = {a\a(0) = (1.2) 

By "smooth" we always mean sufficiently often differentiable. The covering manifold is in fact a 
(discrete) bundle over Af and thus you can think of Af as somehow wrapped around Af such that 
when you go once around a non-contractible loop you are on a different sheet of Af. But the definition 
as homotopy classes of paths is more suitable as paths will naturally appear whenever we deal with 
Af. There are some natural maps on Af, the first is the projection back onto Af: 

Definition 1.2 (Projection). 

ir:Af^Af, vr(a)=a(l). (1.3) 

We use the same symbols a, (3, . . . for paths and for their homotopy classes, i.e. points in Af; this 
should not cause trouble. We only have to check that expressions are invariant under deformations 



whenever a denotes a special path. In (1.3) a denotes a homotopy class (an element of Af) on the 
left hand side, but a special path on the right hand side. However, a(l) is the same for all path 
belonging to a given homotopy class. 

An important feature of paths is that they may be linked together, simply by going along one 
first and then along the other: 

Definition 1.3 (Link). For two paths a : s i— > a(s) and (3 : s i— ► (3(s) such that a(l) = (3(0), the link 
a • (3 is given by 

, I a(2s) for s < i , /1 

a -P :S ^{p(2s-l) for* if. 

With the help of this operation one can show that the structure of the covering manifold does not 
depend on the choice of the arbitrary point x, if Af itself is connected, i.e. for any two points there 
must be a path connecting them. 

Exercise 1.2. The projection ir is locally one-to-one, and by using the same coordinates Af becomes 
a differentiable manifold. If another origin x' defines another covering manifold Af', then there is a 
one-to-one diffeomorphism / : Af — > Af' such that n' o f = n. 

The link is also well defined for homotopy classes of paths, and thus it also acts on Af. However, 
elements of Af can only be linked if the first element is a loop, i.e. a path beginning and ending at x. 
This subset of Af already provides the loop group of Af: 

Definition 1.4 (Loop Group). The set of loops T C Af consists of all (homotopy classes of) paths A 
ending at x, and the group multiplication is •. The unity r] and the inverse A -1 are given by (the 
homotopy classes of) the constant loop and the reversed loop, respectively, 

r](s)=x, A _1 (s) = A(l - s). (1.5) 

The last equation can also be used to define an inverse of any path a, or of any homotopy class of 
paths, but for a £ Af the inverse a -1 is in general no longer an element of Af. As a set we have 
T = ir~ l (x), i.e. T consists of all points of Af lying "over x" . Our definition of the loop group does 
not require any additional structure: it is just a subset of Af and the multiplication is (|0|). 

We can not only link a loop with another loop but also with another path starting at x. This 
gives a second natural map on Af: 
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Definition 1.5 (Shift). For every loop A G T there is a diffeomorphism 

b x :M^M, a^X*a. (1.6) 

If the manifold is not too complicated, i.e. it does not have infinitely many "holes" or something 
like that, than the loop group has a simple structure. In all our examples it will be a discrete group 
generated by a finite number of elements. 

Definition 1.6 (Primitive Loops). A finite set To C T is called a set of primitive loops, if every A G T 
can be written as a finite product of elements of To and their inverses. The loop group itself is then 
given by the free group generated by To, i.e. the set of all possible products of elements, modulo 
some relations, i.e. there may be products of primitive loops which are contractible. 

A well know example is the torus, where we have two primitive loops To = {A, k}, and one relation 
A • k • A -1 • = rj. An example for a manifold with boundaries will be given below. 

There are some other useful notions we introduce here without motivation. They will become 
important later on, but here is a better place to give their definitions. The first is rather simple. 

Definition 1.7 (Periodic Function). A function / on AT is called periodic if / o b\ = f for all A G T. 

Obviously, the projection ir is periodic, and every periodic function can be written as / o tt with some 
function / on J\f . To simplify the notation, we always use the same symbol for a periodic function on 
A/" and the corresponding function on M. More interesting than periodic functions are quasiperiodic 
functions. 

Definition 1.8 ( Quasiperiodic Function). Let G be a group. A map g : J\f — > G is called quasiperiodic 
if there are elements g\ G G such that g o b\ = g\g for all A G T. 

Hence, the shifted function g o b\ differs from g itself by multiplication with a constant from the left 
only. It follows immediately that A \—* g\ is a group homomorphism T — ► G. 

Definition 1.9 (Normalized Quasiperiodic Function). A quasiperiodic function is called normalized if 
9x = 3(A), or equivalently if g(rj) = 1, the unity element of G. 

Such functions will appear naturally as parameters for the solutions of the equations of motion, and 
we will also need them as auxiliary fields necessary to deal with boundaries. 

The last construction in this section also has to do with boundaries. Assume that M is a compact 
2-manifold with some boundaries. Each boundary labeled by an index I is a circle (it is a 1-manifold, 
but it cannot be a real line because M is compact). How does the boundary of M look like? To 
see this, we introduce a periodic coordinate <ft on the boundary of N (with period 2ir). Then we 
can construct a family of paths G M, all starting at x and ending at the boundary point with 
coordinate eft. Once we fixed the path ae = ct^o, an other paths are unique up to smooth deformations. 
Figure [l] shows how the family looks like. Starting from a.£ the paths are obtained by moving the 
end point along the boundary. All these paths represent points on the boundary of A/". The family of 
paths is not unique: there are in general several boundaries of M projected onto the same boundary 
of N , as choosing another ct£, say A» a? for some A G T, we end up with a different family of paths in 
figure |], corresponding to a different boundary of M. However, there is a special element of the loop 
group for which we do not get a different family but only a reparameterization of the same family. 
Calling this A^, we have A^ • = ae t< j,+2w 

If we now assume that this Xi is not contractible, and that this is also true for any power of it, i.e. 
(\e) z 7^ rj for z / 0, then two paths from the same family are never homotopic. Hence, the boundary 
of N is a real line parameterized by ^ G M. For each A G T, which is not a power of A^, we get a 
different line represented by A • on a,, (f) G R. In general there are infinitely many such lines, so the 
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Figure 1: The family of paths $ associated with a boundary of N (the bold circle). The family is 
fixed by choosing the homotopy class of at£. There is also a loop \e £ T associated with the boundary, 
which is given by X e = a^+2 n • a^f 1 for any <j). 




boundary of N is rather complicated. However, for our purposes it is sufficient to know that for each 
boundary of M we can construct a special family of paths a^, which is not unique, but once fixed 
there is a unique loop \i that goes exactly once around that boundary. 

As an example, consider a sphere with N > 2 holes cut out, so that we have iV boundaries like 
in figure |l|. We can arrange the paths ct£ such that for i = 1, . . . , N they leave x "clockwise" and do 
not intersect. Then we can take the \( to be the primitive loops and the loop group of N is given 



as described in definition 1.6 with the relation Ai • ■ • • • Xn = f], as linking all the loops together 
we get a single loop going once around all the holes, which can be contracted. For N = 2 the two 
families form the complete boundary of M (M is a cylinder and M is a infinitely long strip). But J\f 
has infinitely many boundaries for N > 3. 

Exercise 1.3. Of course, we can drop the loop \n from To- Then the loop group becomes the free 
group generated by To = {Ai, . . . , Aat_i}. More general: show that the loop group of a manifold 
consisting of a genus g surface, with iV holes like in figure || cut out, is a free group generated by 
2g + iV — 1 primitive loops. 

Finally, here are some useful features of paths and loops, some of which have already been used: 
Exercise 1.4- Show that 

1. J\f is simply connected, or equivalently J\f = J\f; 

2. definition |1.3| can be changed such that a • /3 is smooth at s = 1/2; 

3. the link is well defined on homotopy classes of paths; 

4. T is a group; 

5. the shift maps provide a representation of T: b„ = id, b\ o b K = b\ 9Ki . 
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1.2 Symmetry algebras and groups 

Three dimensional gravity can be formulated as a gauge theory, the gauge group being the Poincare 
group. The basic fields can be combined into a single connection and the equations of motion then 
require the field strength of this connection to vanish. Thus, to understand three dimensional gravity 
it is essential to know how the Poincare algebra and especially the corresponding group looks like. 
The basic observables, i.e. the gauge invariant parameters of the solutions of the equations of motion, 
will turn out to be elements of this group. 

Definition 1.10 (Poincare Algebra). The Poincare algebra in three dimensions is six dimensional, 
and a suitable basis is given by the three Lorentz rotations L a together with three translations H a , 
a = 0, 1,2 labeling the coordinates of three dimensional Minkowski space M 3 . Their commutation 
relations read 

[L a , L b ] = -e ab c L c , [L a , H b ] = -e ab c H c , [H a , H b ] = 0, (1.7) 

where the structure constants are given by the Levi Civita symbol obeying e 012 = —£012 = +1, and 
indices are raised and lowered using the Minkowski metric rf a i, = diag(— 1, 1,1). Small Latin indices 
a,b,c,... are also called flat indices, referring to the flat metric 77^. 

Let us first have a look at the Lorentz subalgebra spanned by L a . The identity 2rj ab = e ac d£ bd c tells 
us that r] a {) can be chosen to be the metric on the Lorentz algebra, so we can identify it with three 
dimensional Minkowski space M 3 (as a vector space). We need two matrix representations of this 
algebra: 

Definition 1.11 (Representations of the Lorentz Algebra). The vector (or spin 1) representation of 
the Lorentz algebra is so (1, 2) and given by 

L a ^e a \, [e a \,e b \} = -e ab c e c \, (1.8) 

where the dots denote flat indices to be treated as matrix indices. The spinor (or spin 1/2) repre- 
sentation is si (2) and the basis is given by 

L a ^b a , [h a ,b b ] = -z ab cb c i (i-9) 

where the gamma matrices form a Clifford algebra 

lalb = r)abl- £abc 7°, ( L10 ) 

and they are explicitly given by 

*=(-°i J)- «)• Hi -U (U1 > 

The representations act on vectors v a £ M 3 and (Majorana) spinors x £ § 2 > where S 2 is some real 
two dimensional vector space. We will avoid products of spinors so it does not matter whether spinors 
are commuting or anticommuting, and no additional structure is needed on § 2 . To keep the notation 
consistent, we don't write out spinor indices explicitly. Vector indices are sometimes "abbreviated" 
by a dot to indicate that they are used like matrix indices. We also use the gamma matrices to map 
M 3 onto $1(2) and vice versa: 

v = v a ~f a , v a = iTr(^7 a ), v a w a = \Tv{vw), (1.12) 

showing explicitly that M 3 ~ s[(2). We use this isomorphism frequently and also speak of a "vector" 
v which is actually a matrix. We call it spacelike, timelike or lightlike, or orthogonal to some other 
matrix, depending on the corresponding property of v a . 
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Figure 2: The group SL(2) consists of two copies of the part of M 3 between the hyperboloids of 
proper length squared —1. The upper hyperboloid of the right part is glued to the lower one on the 
left and vice versa. The three dimensional manifold is obtained by rotating the picture around the 
vertical axis. The covering group SL(2) consists of a series of such parts, each one's upper boundary 
glued to the next's lower boundary. 




Now it is in principle straightforward to define the Lorentz group as the unique simply connected 
Lie group associated with the Lorentz algebra. Unfortunately this is neither S0(l,2) nor SL(2), 
which would be similar to SU(2) as the simply connected Lie group associated with so (3) ~ su(2). 
In fact, SL(2) is a two-fold covering of SO(l, 2), but it is not yet simply connected. To see this, let's 
have a closer look at SL(2). 

It consists of all 2 x 2 matrices with determinant one. The gamma matrices together with the 
unity matrix 1 provide a basis of all 2 x 2 matrices, so we can write an element of SL(2) as 

g = vl + v a j a , iel, v a G M 3 . (1.13) 

Calculating the determinant yields the restriction 



v 



2 - v a Va = l v = ±vV + 1, (1.14) 



where v 2 = v a v a . Hence, we must have v 2 > —1: the vector v a G M 3 must lie in the region between 
the two timelike hyperboloids of length squared —1. For each vector there are two elements of SL(2), 
corresponding to the sign i n (|1.14j ), except for v 2 = —1, where the two signs actually yield the same 
group element. Changing ( 1.13; ) slightly we introduce a useful set of coordinates on the group: 



g{v\±) = ±(vl + v a la ), v 2 >-l, v = +vV + 1, 

=> 9(v a ,+)=g(-v a ,-) for v 2 = -l. (1.15) 

A picture of this coordinate system is drawn in figure |2| We call the matrices g(v a ,±) spacelike, 
lightlike, timelike, or null depending on the length squared of the vector v a . The null matrices are 
obviously 1 and — 1, and for all others we have |^ Tr(gr) | greater, equal, or less than 1 for spacelike, 
lightlike, or timelike matrices, respectively. 

Two show that SL(2) is a two-fold covering of S0(1, 2), we have to look for a group homomorphism 
that maps g onto a matrix g a b satisfying g a bQ c d T lac = ^Ibd- The solution is given here: 
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Exercise 1.5. 



SL(2) -» S0(1, 2)+ : g ^ <A = \ Tr^V^) (1.16) 
is a two-to-one group homomorphism mapping g and —g onto the same g a b. 

If an SL(2) matrix is denoted by a bold letter, we denote the projected S0(l,2) matrix by adding 
indices to the "un-bolded" symbol. We also drop the + index from S0(l,2), which always means the 
set of proper Lorentz transformations with <?°o > 0. The elements g and —g are located at the same 
point in figure || but in different parts, so SO(l, 2) just consists of one such part only, the upper and 
lower hyperboloid identified. 

In figure || we see that SL(2) is not simply connected, i.e. it is not the unique simply connected 
Lie group associated with sl(2) ~ so(l, 2). There is a non-contractible loop going along the central 
axis from —70 to 70 m ° ne part of the picture and then back on the other side. This is the SO(2) 
subgroup of SL(2). With the tools introduced in the previous section, we know how to obtain the 
simply connected group we are looking for. It is the covering manifold of SL(2). It is obvious how 
this looks like if we draw a picture like figure |2[ We get a whole series of parts instead of just two 
and they are glued together to a chain where each upper hyperboloid is glued to the lower boundary 
of the next part. 

Definition 1.12 (Lorentz Group). The Lorentz group SL(2) is the covering manifold of SL(2) (or 
SO(l,2)) with basepoint x = 1, i.e. its elements are homotopy classes of paths in SL(2), denoted by 
g : s 1— > g(s), with g(0) = 1. The product of two paths is defined "pointwise": 

g : s h-> g(s), h : s 1— ► h(s) => gh : s 1— > g(s)h(s). (1-17) 

The natural projection SL(2) — > SL(2) is denoted by dropping the tilde from the symbol. Note that 
in addition to the group multiplication there is also the link defined on SL(2), and there is in fact a 
simply relation between them. 

Exercise 1.6. Show that the group multiplication coincides with the link •, i.e. gh = g • h, if g is a 
loop. The loop group of SL(2) is a discrete subgroup of SL(2). 

Unfortunately there is no faithful matrix representation for the covering group, so the definition as 
homotopy classes of paths in SL(2) is all we have. However, we can find some useful coordinates on 
the covering group. We can extend ( |iTT5|) using an integer instead of a sign, leading to the coordinates 

g{v a ,z), v 2 >-l, zgZ, 

g(v a ,z)=g(-v a ,z+l) for v 2 = -1, v° > 0. (1.18) 

The projection onto SL(2) reads g(v a ,z) 1— > g(v a , (— ) z ). Another set of coordinates is provided by 
exponentiating the algebra. 

Exercise 1.7. Show that every element of SL(2) can be written in the form 

g = ±exp(n a 7a ), (1.19) 

but the spacelike sector of the right part in figure ^ cannot be "reached" if the sign is positive, i.e. 
there are group elements which are not exponentials of elements of the algebra. 

For the covering group we define 

exp(n a 7 a ) : s ■-> exp(sn a 7 a ), (1.20) 
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which provides the natural exponential map from the algebra into SL(2). Of course, the notation is 
only formal, as there are no matrices -y a such that this map is given by exponentiating them. We 
can only define it as a homotopy class of paths in SL(2). Observe that exp(27T7o) is the unity matrix 
but exp(27T7o) is not the unity of SL(2). Similar to ( p.,19| ) every element of SL(2) can be written as 

g = l z exp(n a 7 a ), where l z = exp (271-70) = gr(0, z). (1.21) 

The group element l z is the origin of part no. z in figure It is a central element of SL(2), i.e. it 
commutes with all other group elements, and lo = 1 is the unity. The 1 2 form the loop group of 
S0(1, 2), and the even elements I22 build up the loop group of SL(2). 

Exercise 1.8. There is also a global coordinate system on SL(2), which can be chosen to be 

g(w, x, y) = exp(>7i + yj 2 ) exp(-ury ), w,x,y £ R. (1.22) 

As a consequence, show that for any smooth field h : M — > SL(2) on some manifold, and its projection 
h : J\f —* SL(2), there exists a smooth deformation h s : J\f — > SL(2) such that h\ = h and ho = 1. 

This property of SL(2) valued fields will be important when defining gauge transformations: it will 
tell us whether two fields configurations are gauge-equivalent or not. 

Our coordinates can also be used to read off whether two group elements commute or not. Using 
( P-.15 ) and the Clifford algebra of the gamma matrices, we find that g(v a , ±) and g(w a , ±) commute 
if and only if the two vectors v a and w a are proportional to each other, and the same holds for SL(2): 



Exercise 1.9. Two elements g{v a , z), g{w a , z') G SL(2) commute if and only if 

e abc v a w b = ^ 3r,s eM:rv a = sw b . (1.23) 

Given an element v = v a -f a G sl(2), then there is the adjoint action of the group element u G SL(2) 
on v : 

v = v a ~f a 1 — ► u x vu = v a u a b j b , (1-24) 

which is an SO(l, 2) Lorentz rotation u a b acting on the vector v a G M 3 . Similarly, an inner automor- 
phism of the group SL(2) acts as a Lorentz rotation on the vector v a in figure ^ 

u~ 1 g(v a , ±)u = ± u~ 1 (t> a 7 a + v l)u 

= ± {v b u b a la + vl)= g(v b u b a , ±). (1.25) 

This transformation is obviously well defined for the covering group SL(2), too, as the length of v a 
is unaffected and we always stay in the same part of figure [2|. Hence, there is a well defined map 

SL(2) -> SL(2) : g ^ u l gu, where u G SL(2). (1.26) 

Note that you cannot use group multiplication to define this map, because you cannot multiply 
elements of SL(2) and SL(2) with each other. There are three alternative ways to define this map 
properly. 

Definition 1.13 (Automorphisms of SL(2)). For each u G SL(2) there is an inner automorphism of 
SL(2), which is either defined by 

u l gu = u x gu (1.27) 
for some u G SL(2) whose projection is u, or by 

u l gu : s 1 ^ u^ 1 g{s)u, (1-28) 
where s 1— > g(s) is a path representing g, or using the coordinates from above we can write 

u- 1 g(v a ,z)u = g(v b u b a ,z) (1.29) 
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Exercise 1.10. Show that the right hand sides of the first two definitions are independent of the 
special u, and of the special path g(s), respectively, that all three definitions are equivalent, and that 
these are all inner automorphisms of SL(2). 

Now that we have the Lorentz group it is not difficult to get the Poincare group, which is obtained 
by attaching the algebra to the group and define an appropriate multiplication. 

Definition 1.14 (Poincare Group). The Poincare group is given by IS0(1,2) = S0(l,2) x so(l,2) ~ 
SL(2) x sl(2), i.e. its elements are pairs (g,f), and the multiplication reads 

(g,f)(g',f) = (99', f + gf'g- 1 ), 

{9,fr l = {9-\-g- x fg)- (i-30) 

Exercise 1.11. Show that this is in fact the group ISO(l,2), i.e. the covering group of ISO(l,2) 
consisting of Lorentz rotations and translations of three vectors, and whose elements are 4x4 
matrices 




(1.31) 



which is the projection of (g, f) £ ISO(l, 2) onto ISO(l, 2) (remember that g a b £ SO(l, 2) is obtained 
from g by first projecting onto g G SL(2) and then using ( |1.16| ); f a is obtained by expanding 

/ !"!<,)■ 

This group is the full gauge group of dreibein gravity. However, we will not make much use of the fact 
that we can write it as a single group or single algebra. This is quite useful when considering only 
classical solutions of Einstein gravity in three dimensions. But when going over to Hamilton Jacobi 
formalism and then to quantum theory, we have to split the group into its components consisting 
of rotations and translations anyway. That's because the gauge fields corresponding to the different 
parts will become canonically conjugate quantities and non-commuting operators, respectively, so 
that it is not easy to keep the "one group" notation. In addition, the physical interpretation of the 
two sets of gauge fields is quite different, so to extract the physical content of various quantities 
becomes easier if we keep the "component notation" (g,f) for ISO(l,2). 

2 Kinematics 

The aim of this section is to present an unusual approach to general relativity. It will turn out 
to have two very different advantages. The first is rather technical: it will be possible to extend 
spacetime beyond the conical singularities arising in three dimensional gravity. As a consequence, 
the solutions are always globally defined and we do not have to exclude points from the spacetime 
manifold. Nevertheless spacetime will look as if it had singularity. All this will become possible after 
allowing certain kinds of singular metrics. 

Another feature of this approach is that in some sense we describe physics "from inside" , i.e. from 
the point of view of an observer sitting somewhere on the manifold. Associated with a local observer 
there will be a local three dimensional Minkowski space, which may be considered as his "laboratory" , 
where he can make experiments with test particles. All measurements, i.e. all physical entities, will 
be expressed as objects in this "local Lorentz frame". With such an observer at each point in space, 
gravity (or geometry) becomes the theory of their interaction or rather their communication. We 
will even go further and forget that the points on the "spacetime manifold" are events. Each point 
just labels an observer at a specific time. This may sound a little bit strange but we will see that 
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this interpretation is able to deal with exactly those singular metrics mentioned above, and "physics" 
does not change at the singularities. 

This approach to gravity is not restricted to three dimensions, except that only there it is possible 
to eliminate all singularities. Technically, what one has to do is to replace the metric by a vielbein 
and the spin connection as basic gravitational fields, then the Einstein Hilbert action is polynomial 
in both and well defined for singular vielbeins. Below we will do this for d = 3 and we will find a 
rather simple interpretation for the singular metrics: if there is a line of "length" zero (not to be 
confused with a lightlike line), or an area or volume of vanishing measure, this just means that all 
the observers associated with this line are actually at the same "physical event". The underlying 
manifold is no longer identical to the physical spacetime. Instead, spacetime is given by identifying 
those points of the manifold which have vanishing distance. 

2.1 Dreibein gravity as a theory of local observers 

One usually derives the dreibein formalism of gravity from the metric formalism by writing the metric 
as a function of the dreibein, introducing a spin connection via the dreibein postulate etc. That's not 
the way we will follow here. Our starting point is the "set of local observers" , which is a 2-manifold 
Af. As already mentioned, the framework presented here is rather general, but we restrict Af to have 
a trivial tangent bundle. Typical examples for such manifolds are two-spheres or tori with arbitrarily 
many holes (but at least one for a sphere) cut out. 

Now each point x £ Af shall corresponds to, or rather is, an individual local observer, who has a 
"laboratory" represented by a three dimensional Minkowski space M 3 . It is called the "local Lorentz 
frame" of x. Vectors in M 3 are denoted by v a , a = 0, 1, 2, and the Lorentz metric r] a b is used to raise 
and lower indices, which has been used already in section |i~2| . Every measurement an observer makes 
has to be related to a quantity in his local frame. In general, all these local observers together form 
an M 3 bundle over Af, but with our restriction above it is reasonable to assume that this is trivial 
too, so that the "flat" indices a, b, . . . are also global. As part of exercise |Ll] the reader may think 
about the strange geometry arising if this bundle is not trivial (or not isomorphic to the tangent 
bundle of spacetime, see below). 

So far we considered a two dimensional manifold which has something to do with "space" . Space- 
time is now obtained by introducing a "time evolution" of space. Hence, our interpretation of gravity 
as theory of local observers is in principle the same as the Hamiltonian or Arnowitt Deser Misner 
(ADM) formalism of gravity j|, except that not spacetime but the "space" Af is our basic back- 
ground structure with its interpretation as a set of observers. 

Using the language from above, what we do is the following. For each observer, we introduce a 
variable t called "time", but which does not have to do anything with the physical time. You can 
think of it as some "bad" clock, i.e. it shows an increasing real number t which can be used to label 
events happening in the laboratory of the observer, but it is not suitable to measure the physical 
time. "Increasing" of course means that there is a relation to the physical time r, namely that r is 
a monotonic function of t. As we will see later on, this can always be established by "gauge fixing". 
An event is now fixed by specifying where it happens, i.e. at which observer x £ Af, and at which 
time t £ R. Hence, we define spacetime to be Ai = Af x R. Our interpretation of space as a given 
set of observers requires the same restriction on spacetime as the ADM formulation: spacetime must 
be globally hyperbolic, in order to get a well defined initial value problem. In the spacetime picture, 
the individual observer x £ Af becomes a worldline x{t) = (x,t) £ Ai. By a gauge choice we will 
be able to make this a timelike path, so that the interpretation as real physical observers is always 
possible. 

The parameter time t can also be considered as a global time coordinate: we can look at the 
universe from the outside and what we see is a space Af, which is given by the set of all observers at 
a specific time t, and this space, i.e. the fields defined on Af, becomes time-dependent. The global 
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coordinate t will enter into the Hamilton Jacobi formalism as time variable: the action will become 
a "time" integral of some Lagrangian, which is a functional of the fields and their "velocities" . The 
Hamiltonian and Poisson brackets are defined such that they describe the evolution of the universe 
with respect to this parameter time. For the moment, however, we are only considering the kinematics 
of gravity, so yet we don't have actions, Hamiltonians and things like that. All we have so far is 
a topological structure, i.e. we know which observers are in a neighbourhood of a special observer. 
We have to introduce some fields to describe the "interaction" of these observers. In particular, an 
observer has to know how far away another observer is and how other local Lorentz frames are related 
to his own. It is exactly this what is described by the dreibein and the spin connection. 

In the following, we will sometimes switch between the "canonical" or ADM formalism, where 
the universe is described by fields on a space N evolving in time, and the "spacetime" point of view, 
where it is just given by the same fields on a manifold M = N x R. Tangent indices on M are 
denoted by k, . . ., and they can be considered as global indices, as we are assuming that N has 
a trivial tangent bundle. For tangent indices on M. we use small Greek letters fi,u,p,..., and they 
"take the values" t and i,j,k, — A covector or one-form on M splits into a scalar vt and another 
one-form Vi on jV, and similar relations hold for arbitrary tensors. As the first gravitational field the 
dreibein is now introduced as a field in M.: 

Definition 2.1 (Dreibein). The dreibein is a linear map from the tangent bundle of M into the 
observer bundle, or simply a matrix that maps a tangent vector v^{x) of M onto a vector v a (x) = 
v^(x)e fJi a (x) in the local Lorentz frame of the observer at x. The map is represented by a field e fJ a {x) 
which we only require to be a smooth function of x. 

No other restrictions are made, in particular, the matrix need not be invertible. In case of a more 
general manifold, e^ a exists locally only and it is possible to define an invertible dreibein only if the 
observer bundle and the tangent bundle are isomorphic, simply because then the dreibein itself is 
an isomorphism. Thus assuming the observer bundle to be trivial is natural if the tangent bundle is 
trivial, otherwise the geometry would become rather strange (see above). 

From the canonical point of view, the dreibein splits into a set of one-forms ei a (x,t) on J\f, a 
2x3 matrix called the "spatial dreibein", and a 3-vector et a (x, t). The physical interpretation of the 
dreibein is as follows. Given an observer at a specific time x with local coordinates x^ = (x l ,t) and 
another one with coordinates x^ + dx^. Then dx a = e^ a dx^ is a vector in the local frame of x that 
tells him in which direction and distance x + dx is. We call this vector the "distance" from x to 
x + da;. In particular, the length of the path from x to x + da; is given by the length of this vector: 

ds 2 = da: a da: a = g^dx^dx", where g^ = e^ej'r]^. (2.1) 

Restricting this consideration to a specific moment t = to, we also get a metric on N . The distance 
of the two observers at the same time is given by the spatial dreibein and reads 

ds 2 = dx l dx j gij, where gij = ei a ej b rj ab (2.2) 

is the induced metric on N as a submanifold of M defined by t = to. Note that this is not necessarily 
positive definite, and g^ u is in general not invertible, as there is no such restriction on the dreibein. 
The time component et a (x,t) describes the relation between the parameter time t used by x and his 
physical time r. The proper time elapsing when t increases by dt is 

dr 2 = - dt 2 efefvab = ~ dt 2 g t t- (2.3) 

For r to be well defined, et a must be a timelike vector, which is equivalent to requiring that the 
worldline on the observer is timelike. As et a will turn out to be a pure gauge degree of freedom, we 
could restrict its values to any open subset of M 3 (otherwise we might loose equations of motion). 
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However, we can equally well keep it arbitrary, and whenever we have a special configuration we 
can find a gauge transformation such that et a becomes timelike and we can read off the physical 
properties of that configuration, which are by definition invariant under gauge transformation. 

It is now straightforward to interpret singular dreibein fields. If e^ a has a zero eigenvalue one 
can find dx^ = (di, dx l ) such that dx a = 0. Using our interpretation from above this means that 
the distance between the two observers (possibly at different parameter time, if dt ^ 0) is zero, they 
actually sit at the same spacetime event. The point x £ A4 labels an observer at a given moment, not 
a point in spacetime. More precise, if dx a = 0, then the same physical event is seen by observer x at 
time t and observer x + dx at his time parameter t + dt: the two (or more) observers meet each other 
at this event. Physical spacetime is defined as the set of equivalence classes of points in A4, where 
two points are equivalent (i.e. two observers see the same event), if there exists a path connecting 
them such that dx a = everywhere on the path. The resulting spacetime metric is non-singular in 
the sense that two different events always have finite distance. Note that dx a = is different from 
ds 2 = 0, which states that a path is lightlike. Observe also, that now the topology of spacetime is 
no longer fixed: different dreibein fields and thus different equivalence classes may lead to different 
topologies. An illustrating example will be given below. 

Definition 2.2 (Spin Connection). As a second gravitational field we introduce the spin connection 
by defining a covariant derivative of vectors in the local Lorentz frames. It is represented by an 
so(l,2) gauge field = —uj^ba- Given a vector field v a , then the covariant derivative reads 

D/ = 5/+</ (2.4) 

We can expand ui^ in terms of the generators of the vector representation of the Lorentz algebra 



(see definition 1.11 ) to get 

The covariant derivative then becomes 

D^v a = d fl v a + u fiC e ca b v b . (2.6) 

Thus oj^a is the abstract Lie algebra notation for the spin connection (a = 0, 1, 2 labeling the genera- 
tors), whereas u)^ ab is its vector representation. The field strength or curvature of the spin connection 
is defined as the commutator of two covariant derivatives: 

D {ll D v] v a = \F^\v b . (2.7) 

It reads explicitly 

F^ivab = dfi^uab ~ dy^^iab + ^iia^vcb ~ ^va^^cbi (2-8) 

in the vector representation, and the abstract algebra representation is 

where the structure constants —£ a bc appear. A similar relation as for the spin connection itself holds: 

^[wab — &ab Ffiuc Ffxva — 2^ a ^fivbc- (2-10) 

Like the dreibein, the spin connection and its field strength split naturally into its space and time 
components, simply by inserting i,j or t for the indices fi, v. The spatial component describes how 
vectors in the local Lorentz frames of two observers can be compared. Consider vectors v a {x) and 
v a (x + dx) measured by different observers (at the same time t). The difference of these vectors, 
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measured in the local frame of x is given by dx l DiV a . Similarly, the difference of two vectors measured 
by the same observer x at different times, v a (t) and v a (t+dt) reads dt DfV a . We will give more precise 
definitions for how to compare measurements made by different observers or at different times below 
by introducing transport operators. However, as we already introduced all the fields, we can see what 
dreibein gravity is: the theory that describes how measurements made by different observers can be 
compared and what the physical distance between them is. 



2.2 Transport operators and geodesies 

In this section we will construct non-local objects as functions of the fields e^ a and ui^ab, describing 
concrete finite "measurements" made by our observers. In metric general relativity one defines a 
parallel transport operator as follows. Given a path a : [a,b] — > Ai. The transport operator maps a 
tangent vector v^{a{a)) at the beginning of the path onto another tangent vector at the end: 

v^(a(b)) = v u {a(a)) U a (a, b)^. (2.11) 

The matrix U a (a, b) v ^ £ GL(3) is defined such that the vector is covariantly constant along the path, 
i.e. its covariant derivative with respect to b vanishes. The full definition is 

U a (a, a) v ^ = 5^, 

V s U a (a, s)^ = ^ U a (a, s)^ + U a (a, 3) v T^ p ol a = 0. (2.12) 

Note that the two indices of U are tangent indices sitting at different points and transform differently 
under coordinate transformations, so that the covariant derivative acts on the index located at a(s) 
only. The notations here is as follows: if the path's tangent vector a'^ or a field appears in a product 
with such a transport operator, the ordering of the product indicates where the fields are to be taken; 
i.e. in ( |2.12| ) the tangent vector has to be taken at s and the Christoffel connection at a(s). 

A geodesic can now be defined as a path whose tangent vector is covariantly constant. We can 
write this as a local differential equation or as a non-local equation using the transport operator 

V a a'"(s) = o a'»{b) = a' v {a) U a (a, &)„". (2.13) 

More precise, it is sufficient that there is a reparameterization of the path such that this equation 
holds. This definition is equivalent to the "minimum length" or "maximum proper time" requirement 
(if no torsion is present), but it is more general as it applies to lightlike paths as well. Unfortunately, 
it is not suitable for our formalism, because we are dealing with local Lorentz vectors rather than 
tangent vectors of the manifold and also because the definition does not work for non-invertible 
metrics. However, as we introduced a connection as a basic field, we can define a transport operator 
with respect to this connection: 

Definition 2.3 (Transport operator). Given a path a : [a, b] — > Ai (always assumed to be smooth) we 
define a matrix U a (a,b) a b mapping a local Lorentz vector at a(a) onto one at a(b). It is uniquely 
given by 

U a (a,a) a b = 5 a b , 

^ U a (a, s) a b = U a (a, s) a c L0^ c b a^. (2.14) 

As u> pa b is an so(l,2) connection, this gives an SO(l,2) matrix, i.e. a proper Lorentz rotation. One 
can also write U as a path ordered integral 

U a (a,b)/ =Vexp ( t ds a'* u^A, (2.15) 

J a 
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where path ordering is defined such that factors nearer to the lower boundary of the integral appear 
to the left. We do not require any ordering like a < b for the parameters. Transport operators may 
be linked together and they are "symmetric": 

U a (a,b) a b U a (b,c) b c = U a (a,c) a c , U a (a,b) ab = U a (b,a) ba . (2.16) 

The second equation follows from the first if we set a = c and use the fact that the inverse of an 
S0(1, 2) matrix is given by its transpose, multiplied by the flat metric from both sides. In section 1.1 
we always used the interval [0, 1] as the range for s, so we introduce U a ,a as an abbreviation for 
U a (0, l) a b , and instead of pl6| ) we can write 

U a ,a U/3^b C = Ua,/3,a C , U^ab = U a -i ba . (2-17) 

How is this transport operator related to the metric operator U a (a,b) fJi u defined above? A priori 
there is no relation, because it depends on the spin connection only whereas the metric depends on 
the dreibein. The relation we want to establish is that, given a vector at the beginning of some 
path, parallel transport with respect to the metric and then mapping it onto a local Lorentz vector 
by the dreibein should give the same result as first mapping it onto a local Lorentz vector and then 
transporting with respect to the spin connection. This is the "dreibein postulate" , which can also be 
formulated locally and is then called "torsion equation": 

Exercise 2.1. Show that, if the dreibein is invertible, the non-local dreibein postulate is equivalent 
to the torsion equation: 

U a (a,b)^e u b = e ll a U a {a,b) a \ & D^erf = 0. (2.18) 

Here one has to use that the torsion equation actually states that the total covariant derivative of the 
dreibein vanishes (i.e. including the torsion-free Christoffel connection, but without antisymmetriza- 
tion). As the torsion equation is an equation of motion, i.e. it is part of the "dynamics" of dreibein 
gravity, it is ensured that our transport operators and geodesies coincide with those derived from the 
metric in regions where it is invertible. 

There are two important formulas describing how the transport operator changes under variation 
of the path or the spin connection, respectively. 

Exercise 2.2. The functional differential of the transport operator ( |2.15| ) with respect to the connec- 
tion reads 

rb 

5U a (a,b).-= dsU a (a,s).' a'^Su^/ U a (s,b).\ (2.19) 

J a 

Exercise 2.3. Under a smooth deformation a i— ► a + 5a of the path the variation of the transport 
operator is 

SU a (a, b) . ' = U a (a, b) . ' bo^oj^. ' — Sa^uj^. ' U a (a, b) . ' 

+ / dsU a (a,s).- 6a"a' u F^.-U a (s,b).\ (2.20) 

J a 

Here we replaced the flat indices of the transport operator again by the dots introduced in ( |1.8[ ) to 
show that these are general formulas that hold for every representation of the transport operator, 
we only have to replace the field strength and connection by the appropriate terms. The indices are 
arranged such that they multiply the so(l, 2) and SO(l, 2) valued fields as matrices. Remember also 
the convention that the points where the fields are to be taken are indicated by the factor ordering, 
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i.e. in the first term in ( 2.2C| ) we have to take them at a(b), at a(a) in the second and at a(s) inside 



the integral. We see that, if we fix the end points of the path, the variation is proportional to the 
field strength. 

We can now extend the definition of a geodesic to singular metrics. However, it is no longer 
possible to parameterize the path using its own "proper time" or "proper length" . 

Definition 2.4 (Geodesic). A path a is called a geodesic if there exists a vector c a at a (fixed) point 
a{a) and a function r(s) such that 

U a {a,s) a b eJ>a'» = T'(s)c a . (2.21) 

Hence, the tangent vector of a geodesic, projected into the local Lorentz frames, is covariantly 
constant up to rescaling. If the geodesic passes through a region where the dreibein is singular, r' 
might vanish, showing that the "proper length" does not increase: the path remains at the same 
spacetime event though it passes from one observer to another. In these regions with singular 
dreibeins the new geodesies behave quite strange but as we will see in the example below the definition 
still makes sense. If the dreibein is invertible, we obviously have r' 7^ whenever a'^ 7^ 0, and we 



can reparameterize such that r(s) = s. With ( 2.18; ) we infer that our definition is then equivalent to 
the metric definition of a geodesic given above. 



The vector c a appearing in ( 2.21 ) is something like the finite version of dx a introduced above to 
measure the "distance" of two observers in spacetime. To see what c a is, assume that a is a geodesic 
from a(0) = x to a(l) = y. We can parameterize it such that r(0) = and r(l) = 1. Then it is 
obvious that c a tells us how to get from x to y, namely by going in the direction indicated by c a (it 
is the tangent vector to the geodesic at x), and the distance is given by the length of c a , just as for 
the infinitesimal quantity dx a above. Hence, we may call c a the "distance" from x to y, measured 
in the local Lorentz frame at x. 



Given the geodesic, we can calculate c a by integrating ( 2.21 ) 



c a = f 1 dsU^sne^a'^. (2.22) 
Jo 

This expression is invariant under reparameterization of s, we just have to know the geodesic itself 
to compute the distance c a . In addition, the value of this integral also exists if a is not a geodesic. 
It defines another quantity associated with a path. Let us call it the "distance of the path a" 

r b 

V Q (a,b) a = dsU a (a,s) a b e^ b a^, (2.23) 

J a 

which is a vector in the local Lorentz frame at the beginning of the path. An alternative definition 
is, similar to ( |2.14| ), the differential equation with initial condition 

V a (a, a) a = 0, ^ V a {a, s) a = U a (a, s)\ e b a'». (2.24) 

As for U we use the abbreviation V a a = V a (0, l) a . We have the following relations for the link or 
inverse of paths: 

V a .f3 a = V a a + U a a b Vf3 b , V a li = -V a b U a , b a . (2.25) 

Note that in the first equation Vg b is a vector at /3(0) = a(l), and U a a b maps this vector onto a vector 
at a(0), which can be added to V a a . Together with (|2.17| ) this is exactly the group multiplication 



matrix, we can summarize their definitions as 



on ISO(l,2) as introduced in definition 1.14. If we arrange U and V such that they build a 4 x 4 

a'* (2.26) 



d_ (U a a c V a a \ = fU a % V a a 
ds \ 1 J V 1 
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The last matrix is the combined iso(l,2) connection. However, as already mentioned, we will not 
use it very much, as the physical interpretation of the two connection fields are quite different and 
the components are canonically conjugate phase space coordinates and non-commuting quantum 
operators. Finally, we also have to know how V transforms under deformations of the path. The 
formula is similar to that given for U. It will be sufficient to know how the transformation looks like 
if the field equations of gravity are satisfied. 

Exercise 2.4- Assume that the torsion equation holds and the spin connection is flat. Then under a 
smooth variation of the path 5a the transport operator and the distance change by 

6U a (a, b) a b = U a (a, b) a c 5a" ^ b - 5a" u m b U a (a, b) c \ 
5V a (a, bf = U a (a, bf b 5a" e b - 5a" V a (a, b) h + e/). (2.27) 

As indicated by the factor ordering the fields and 5a" are to be taken at a(b) in the first term and 
a(a) in the second term in both equations. 



2.3 Spinors, test particles, and measurements 

So far we considered paths and geodesies in spacetime. For a given path we defined a transport 
operator and a "distance" vector. We know want to discuss the physical meaning of these objects 
and how they can be measured. To make such measurements, our observers can use test particles. 
These test particles shall have the following properties. They may be scalar, spinor, or vector like 
particles, i.e. in a local Lorentz frame they are represented by a scalar (this is just a point particle 



with not extra structure), a spinor x as introduced in definition 1.11, or a vector v a . 

Any observer x can "emit" such a particle at a time t, it will then move along a geodesic to 
another observer y, where it arrives at time t'. The "state" of the particle, i.e. the vector v a or the 
spinor shall stay covariantly constant while the particle is moving. In addition, the observers can 
measure geodesic distances, the result of such a measurement being the vector c a appearing in ( |2.22| ). 
For physical reasons we should restrict the paths of test particles to be timelike or lightlike, but to 
simplify the treatment of measurements, let us also allow tachyons as test particles: 

Exercise 2.5. In principle, for all following the experiments there is a possible physical realization of 
another experiment without tachyons, giving the same result if the equations of motion are satisfied 
(i.e. the dreibein postulate must hold and the connection must be flat). The only reason for such an 
experiment to fail is that horizons might be present, so that, e.g., a real particle can never pass along 
a certain loop in space. Find out which of the spacetimes constructed in section [O] have horizons 
and how this affects the measurements described here. 

Consider a vector particle emitted in a state v a (a) at a(a). The observer at a(b) will see it in a state 

v b (b) = U a (b,a) b a v a (a). (2.28) 

Locally this can be written as 

D s v a = d s v a + a'"u]^ b v b = 0. (2.29) 

For a spinor particle we get exactly the same, we only have to replace the vector representation of the 
Lorentz algebra by the spinor representation. We denote the spinor representation of the connection 
and the covariant derivative of a spinor by 

^ = h^nal a , D^x = d^x + (2-30) 
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The corresponding representation of the field strength is given by the commutator of two covariant 
derivatives 

D [^ D v]X = \F^vX =>- Fy* = d^u - + U3 V \. (2.31) 
We define the spinor transport operator similar to ( 231 ), with u^. ' replaced by u;^: 



U a (a,a) = l, -^U a (a,s) = U a (a,s)^a'». (2.32) 
Hence, for a spinor particle moving along a path a we have 

D S X = X {b) = U a (b,a)x(a). (2.33) 

Exercise 2.6. Show that the spinor transport matrix obeys 

U a Up = U a . p , U- 1 = U a -i , (2.34) 

where again U a is an abbreviation for U a (0, 1), and that it is related to the vector transport operator 
by 

U a {a, b) ab = \ Tr( U a (b, a) la U a (a, b) lb ), (2.35) 

which is the projection SL(2) — > S0(l,2). From this relation we can infer that we only need spinors 
as test particles to extract all information about the transport operators. 

We can convert all our notation into the spinor representation by using the fact the three dimensional 
Minkowski space M 3 is isomorphic to the Lorentz algebra sl(2). Hence, every 3- vector can be 
converted into an sl(2) matrix by expanding it in terms of the gamma matrices. Doing this with the 
dreibein, we get the "curved" gamma matrices 

7 M = e/7a, e/ = iTr( 7M 7 a ), 3^ = ^(7^), (2.36) 

and the covariant derivative of an sl(2) valued fields is 

v = v a la , D^ = D^v a la => D M « = M « + [«„,«]. (2.37) 

Now the gravitational fields 7^ and uj^ both take values in sl(2). We can also define a matrix valued 
distance V a = V a a ~y a , so that the two basic non-local objects are 

U a = "Pexp / ds a' 11 u>„ , 
Jo 

V a = [ dsU a (0,s)^a^U a (s,0). (2.38) 
Jo 

We can even go further and introduce an SL(2) transport operator. However, this is not a matrix 
and does not correspond to any physical measurement with test particles. It is quite simple to define 
it as an equivalence class of paths in SL(2): 

U a (a,b) : U a (a,a + s(b - a)). (2.39) 
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Exercise 2.7. Show that the identities ( 2.34 ) hold for U as well. This is not as trivial as it seems to 



be: for the inverse you have to show that 

Ua 1 : s i ^ U(s, 0) ~ U a -i : s i-» U(l, 1 - s), (2.40) 

and for the link the following paths in SL(2) have to be homotopic: 

~ fu Q (0,2s) fors<i 

a,/3 ' S \ U a Up(Q, 2s - 1) for s > \ 

I 

U a Up : s ' * U a (0, s) Up(0, s). (2.41) 



The pair (U a ,V a ) is an element of ISO(l,2), and the relations ( 2.34 ) and ( 2.25 ) can be combined 
into 

(U a .p, V a . p ) = (U a , V a ) {Up, Vp), 

(U a -i,V a -i) = (U a ,V a y\ (2.42) 

with the multiplication introduced in ( |1.30|) . The V component of this relation reads explicitly 

V a .p = V a + U a VpUa\ V a -i = -U« x V a U a . (2.43) 

Let us now briefly describe how the values of these objects can be obtained by concrete measurements. 
Anticipating the results from section || let us assume that the fields are solutions of the equations of 
motion, which means that the torsion equation holds and that the field strength of the spin connection 
vanishes. From ( 2.27| ) we infer that the value of the transport operator or the distance then depends 



on the homotopy class of the path only. 

It is quite easy to measure U a and V a if there is a geodesic homotopic to a. In this case one 
can simply send a spinor (or rather two independent spinors) along that geodesic to measure the 
transport operator U a , and V a is the geodesic distance as defined in ( |2.22| ), converted into an sl(2) 
matrix. This also means that if there is more than one geodesic within the same homotopy class, 
then they have the same distance vector c a and, provided that the metric is invertible, they are in 
fact identical up to reparameterization. This is nothing but the trivial fact that in flat space there 
is only one geodesic connecting two points. However, there might be different geodesies in different 
homotopy classes, if spacetime is not globally Minkowskian. On the other hand, not every homotopy 
class contains a geodesic. 

Exercise 2.8. To measure U a or V a for a path such that there is no geodesic homotopic to a, one 
can use a piecewise geodesic path homotopic to a and compute the final result by ( 2.42Q . How can 
such a measurement be realized? 

Hence, we are now able to measure the transport operator and the distance for all (homotopy classes 
of) paths. Returning to the ADM picture, we can distinguish between paths that lie entirely in M and 
those in time direction. For the former, the transport operator depends on the spatial components 
Ui only, and we can say that a measurement (passing a test particle from x to y) takes place at 
the time t (this needs a tachyon, but see exercise ^^). This is what we need for quantum physics: 
measurements that are carried out at a fixed time. So let us stick to paths in M in the following. For 
path in M all the formulas above remain unchanged except that we have to replace the index [i by i, 
i.e the tangent vector of the path becomes a h etc., and all objects become time dependent. By these 
measurements we cannot get information about the time components -ft an d So let us assume 
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that an observer x 6 M is also able to measure the quantities u^(x, i) and 7t(x, i) at any time t, and 
we can define a transport operator and distance in time direction 

U x (ti,t 2 )=Vexp dtu>t(x,t), 
Jti 

V x (t u t 2 )= [ ' dtU x (t 1 ,t)'y t {x,t)U x (t,ti). (2.44) 

Obviously, tells x how to compare measurements made at different times, as it transports a spinor 
along the worldline of x. As already mentioned, the time components uit and 74 will turn out to be 
pure gauge degrees of freedom, so actually x can not only measure them, he can adjust them as he 
likes. 

Exercise 2.9. Show that with ut = and 74 = 7o the observer becomes free falling, his local Lorentz 
frame is co- moving (i.e. it is his local restframe and does not rotate), and the time coordinate t 
represents his proper physical time. It is U x (t\,t 2 ) = 1, and V x (ti,t 2 ) = (t 2 — ii)7o behaves like a 
clock. 



2.4 Singular metrics and conical singularities 

In this section we consider a simple example to show how the typical singularities of three dimensional 
gravity are described in dreibein formalism. We start with a flat Minkowski spacetime with polar 
coordinates t, r, (p. Outside the origin the dreibein and spin connection satisfying the torsion equations 
are given by 

e t ° = l, e r l = l, e/ = r, = 1, (2.45) 

all other components vanishing. Alternatively, we can write it in spinor notation as 

It = 7o, 7r = 7i, 14> = r 72, tity = 570. (2.46) 



In the sense of definition [2J this is not really a dreibein, as it is not defined for r = as a map from 
the tangent space of the manifold into M 3 . But we can change the topology of our space to make 
it well defined. Instead of having a Minkowski space with polar coordinates, we choose the space 
manifold to be a cylinder J\f = R x S , i.e. we let r take any real value and do not identify points 
with different angles 4> at r = 0. The only identification is <fr = (ft + 2ir. Thus the "origin" of space 
becomes a circle and space extends "behind" this origin. One can check that ( gig ) is a well defined 
dreibein and spin connection on this manifold and that the torsion equation is satisfied everywhere. 
The dreibein is singular at r = and with our interpretation from above this means that all observers 
with r = but <p arbitrary sit at the same physical point. The topology of space (and spacetime) is 
not N x R but space consists of two "cones" (with vanishing deficit angle) glued together at the top. 

We also immediately see that the curvature or field strength vanishes everywhere and thus the 
transport operator along a loop depends on its homotopy class (which measures how often the path is 
wrapped around the S l ) only. Given a path a(s) = (t(s),r(s), <f>(s)), solving the differential equation 
(|IJ) yields 

no ox 

U a (a,b).'= cos Acj) -sin AH, A(j) = (j)(b) - <j){a) . (2.47) 
\ sin A(f> cos A<j> ) 

In fact, here it actually does not depend on the homotopy of the path, as adding 2ir to the angle 
does not change U. However, transporting a spinor once around the circle changes its sign, as the 
spinor transport operator is 

rT , n , lA , N / cos(A/>/2) sin(A/>/2)\ , n 

U a (a, b) = exp ( 1^70 ) = ( _ ) ■ (2.48) 
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How does a geodesic on this manifold look like? Of course, in the two regions r < and r > 
spacetime is just ordinary Minkowski space and geodesies which entirely lie in these regions are 
simply "straight lines". A possible parameterization is 



— / TS 

t(s)=t + ts, r(s) = ±Jr 2 l +f 2 s 2 , <f>(s) = O + arctan — , (2.49) 

v r 

where (toj^Oj^o) is the nearest point to the origin and the ratio f : i gives the "velocity". Next 
consider a path that lies entirely inside r = 0: 

Exercise 2.10. Show that for a path a{s) = (t(s),0,(j)(s)) we have 

U a (a, s) a b e^ b c/ M = t'(s)c a , where c a = (1, 0, 0), (2.50) 
i.e. it is a timelike geodesic with r(s) = t(s) as "proper time". 

We see that a geodesic is no longer determined by giving a start point and a tangent vector. However, 
the definition still makes sense, because all these paths on the manifold N x K represent the same path 
in physical spacetime. This was defined as the equivalence classes of points in Ai having vanishing 
distance, and we saw that (t, 0, 4>) and (t, 0, 4>') have vanishing distance and thus all the paths above 
represent the "physical" geodesic t = t(s),r = 0. 

Finally we have to consider geodesies that pass through r = 0. For such a path we must have 
4>' = whenever r/0 (otherwise it would be of the type ( 2.49Q ). It therefore consists of three parts, 
the first and third being radial lines 

t = t- + (s — s_)t_, r=(s — S-)f_, (/> = </>_ for s < s_, 

t = t + + (s — s+)t+, r = (s — s+)f + , (f) = 4>+ f° r s > s +i (2.51) 

and in between we take an arbitrary path inside the r = circle as an ansatz: 

t = t(s), r = 0, <j) = 4>(s) for s_ < s < s+. (2.52) 

Mapping the tangent vector of this path into the local Lorentz frames gives 

(£_, f_, 0) for s < s_, 
y = { (t'(s),0,0) for s_<s<s+, (2.53) 
(t + ,f + ,0) for s > s + . 



Exercise 2.11. Now you can show that the following conditions must hold for these three parts to fit 
together and to build a single geodesic: 

1. t'(s) = in the middle part and thus t+ = i_; 

2. the path can be reparameterized such that t + = t_; 

3. either <p + — = (mod 27r) and f + = f_, 

or + — (/>_= 7r (mod 27r) and f + = — f_. 

The first statement says that the path, as seen from outside, "does not spend time" at r = even 
though the parameter might increase from s_ to s+. From the second we can infer that r = s — s_, 
r = 0, r = s — respectively for the three parts, is a possible proper length parameter for the path. 
Finally the third statements tells us how the two "outside parts" fit together: the first alternative 
gives a line that disappears from one region being continued in the other region where r has opposite 
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sign. The second one is a path that entirely lies within, e.g., r > and is just a usual straight line 
through the origin, where the angular coordinate changes by 7T. 

It is quite surprising that this comes out here, as our manifold is not Minkowski space but a 
"cylinder" with a singular metric. On M the geodesic through r = looks rather strange: It enters 
r = at some angle </>, then winding around the S 1 some integer or half integer times, and then it 
leaves r = either into the region it came from (for "half integer" ) or into the other asymptotic region 
(for "integer" ) . So this example shows that our definition of a geodesic is a reasonable extension to 
singular metrics. 

Now all this might be amazing but, beside a strange extension of Minkowski space beyond r = 0, 
what is it good for? The answer is that with the same trick we can extend spacetime beyond a 
conical singularity. All we have to do is to change the values of the fields slightly. Instead of ( 2.46| ) 
we choose 

It = 7o, 7r = 7i) l<f> = (r~f2, o^ = ±C7o, (2.54) 

with a real constant £ > 0. The torsion equation is still satisfied and the curvature vanishes, but the 
transport operator 

/I \ 

U a (a,b)/= cos(CA/>) -sin(CA£) , (2.55) 
\0 sin(C^) cos{( Ac/)) J 

is no longer independent of the homotopy class of the path. Changing A<p by 2tt gives a different 
value for U a . The formula ( 2.49| ) for a geodesic outside the singularity does not change very much. 



One just has to replace the arctan by l/£arctan. This leads, e.g., to the fact that for £ < 1/2 the 
geodesic can intersect itself. 



Also exercise 2.11 still holds, except that "mod 27r" has to be replaced by "mod 27r/£". This 
changes the behaviour of the geodesic at r = drastically. It says that the path enters r = 
somewhere, goes around by an integer or half integer multiple of 2ir/( then going back or into the 
opposite region. However, going 2ir/( further before leaving r = will result in a different direction 
of the radial line: the geodesic is "scattered" at r = in any direction A<p = 2irz/( (mod 2ir), 
z£Z. There are only finitely many such directions if £ is a rational, otherwise there is a dense set 



of possible directions. Here our geodesies are very different from those defined by (2.13) or by the 
extremal length condition. The first type is not well defined at r = 0. Paths with extremal proper 
length are also scattered at r = 0, but they exist for £ > 1 only (otherwise it is always "shorter" to 
avoid the singularity) , and instead of a discrete set of possible directions there is a continuous range 
around A<p = it. 

So after all we have learned that it is possible to introduce singular metrics into three dimensional 
gravity such that at a conical singularity physics does not change in the sense that we do not have 
to give up the interpretation of the dreibein and connection field. Our example showed the simplest 
kind of such a singularity, in the next section we will find other singularities with more interesting 
structure. 



3 Classical Dynamics 

The dynamics of gravity is of course determined by the Einstein Hilbert action. In dreibein formalism 
it can be most easily expressed in terms of the field strength of the spin connection introduced in 
(P>D. It reads 

I=\Jd?xe^e^F vpa . (3.1) 
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Alternatively, we can express it in terms of the spinor representation and the curved gamma matrices 
= e p a ~fa- Remember that for the spin connection there is an extra factor of one half in the 
corresponding relation = ^uj fia 'j a , so that 



| /d^e^TY^i^). (3.2) 



The Levi Civita tensor density e pvp is defined such that the dreibein determinant e = det(e /J a ) obeys 
the following relations: 

6£ — £ p ^Gfj, &u &p i 6 £p,up — ^abc^p, &v &p ■ (^"^) 

The components obey e pvp = —£p U p (i-e- A4 has negative signature). For the coordinates of our 
cylinder manifold we have e tr ^ = +1. We will almost immediately go over to the ADM formulation, 
where we have a Lagrangian on M instead of the action on M = J\f x R. All we want to do in this 
A4-framework is to derive the equations of motion and have a brief look at the gauge symmetries 
of the action. Thereby we will ignore all the problems that might arise from ill-defined integrals or 
boundary terms produced by partial integration. We will deal with these problems in detail below. 
If we just vary the action with respect to the fields we get the equations 

D [p e v\ a = d\ifiv\ - £ab c^[pb e u\ = 0) 

\Fpva = 9[n^>v]a — \ £ a C ^ph^>vc = 0, (3.4) 

D lfiM = d [plu\ + [<*fy,7i/]] = 0) 
\F^ U = dfrU v] + fafyaty = 0. (3.5) 

The first is obviously the torsion equation. We already saw that its consequence is that our definitions 
of geodesies and transport operators coincide with those in the metric formulation. The second 
equation than states that spacetime is flat. So we do not get any local degrees of freedom like 
gravitational waves. All the physical properties of solutions to these equations have a "global" 
character, they are encoded in non-local quantities like those constructed in the previous section. The 
aim of this section is to identify these quantities and to give a classification of the set of physically 
inequivalent solutions. If you are not familiar with dreibein formalism, you should check that the 
model here is in fact equivalent to the metric description of three dimensional gravity pO, p|, 22 , p3| : 



or in the spinor representation 



Exercise 3.1. Show that, if the dreibein is invertible, the torsion equation can be solved for the 
spin connection uniquely. Inserting this solution into the field strength yields the Riemann tensor 
Rpupa = FpuabGp a e a b ^ and the action becomes the Einstein Hilbert action ~gR[g\. 

The action also has gauge degrees of freedom, and in dreibein gravity these are the local Lorentz 
rotations and diffeomorphisms. In our language introduced in section |2[ they directly correspond to 
Einstein's relativity principles: For every observer the special relativity principle holds, i.e. physics 
looks the same in any local frame, as long as they are related by a Lorentz rotation, and general 
relativity means that physics looks the same for all observers and therefore interchanging observers 
(that's what diffeomorphisms do) does not change physics. 

The local Lorentz symmetry is realized by transforming the fields as 

Suj^ = L [A] = D M A, ^ = I [A] 7 M = [7^, A], (3.6) 
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where A is an si(2) valued field on A4. To be more precise, at this point we only found a symmetry. 
Let us postpone the exact definition of what is a gauge symmetry. It will become clear in the 
Hamilton Jacobi formalism below. 

Here L is introduced as a linear operator acting on fields. Diffeomorphisms are as usual generated 
by the Lie derivative of the fields. However, there is another symmetry of the action which is more 
basic than the diffeomorphisms and is also more suitable for our purpose: it treats 7^ as a gauge field 
just as is treated as a gauge field of the Lorentz symmetry. We call the symmetry transformation 
a translation. It also has an sl(2) field n as parameter: 

5^ = H[n] u)^ = 0, = H[n] 7 M = D^n. (3.7) 

To show that this is a symmetry of the action, one has to use the Bianchi identity D\^F vp \ = after a 
partial integration. The diffeomorphisms generated by the Lie derivative with respect to some vector 
field can now be obtained by combining these two symmetries: 

Exercise 3.2. Show that, given any vector field on A4, one can choose (field dependent) parameters 
n and A such that L [A] + H[n] acts as the Lie derivative with respect to this vector field. 

If there are no more gauge symmetries, then they should form a closed Lie algebra: 

Exercise 3.3. The operators L[X] and H[n] provide a representation of the local Poincare algebra 
iso(l,2) , i.e. their commutators read 

[L[X l ],L[X 2 ]) = L[[X 1 ,X 2 ]}, 
[L[X},H[n}]=H[[X,n]}, [H[m], H[n 2 }} = 0. (3.8) 

3.1 Lagrangian and boundary conditions 

So far we considered the classical equations of motion. To go over to quantum theory we want to 
reformulate the theory in Hamilton Jacobi formalism and then apply the Dirac canonical quantization 
method. All the necessary structure for this purpose has already be introduced. Spacetime is given 
by Ai = J\f x R, hence we have a global time coordinate which can be used as the canonical time. 
We just have to split off the integration over the real number t from the action integral ( |3.2| ) to get 
the Lagrangian. 

This should yield the same equations of motion, but now some of them become time evolution 
equations for the fields, others do not contain time derivatives. These are "constraints", i.e. they 
restrict the initial conditions we may choose for the remaining evolution equations. What we want to 
do in the following is to provide a well defined Lagrangian, compute momenta and Poisson brackets 
in the usual way and end up with a set of constraints and a Hamiltonian, which generates the correct 
time evolution by taking Poisson brackets. Even the first step will turn out not to be easy, because 
we want to allow our set of observers M to be non-compact like the cylinder in the last section. After 
some redefinitions of fields we will be able to compactify J\T, but this manifold will have boundaries. 
We have to introduce boundary conditions on the fields to get a finite Lagrangian and we will see 
that it is also necessary to introduce some extra degrees of freedom to get the correct equations of 
motion on these boundaries, which correspond to "spatial infinity". 

Let us first try to get the Lagrangian directly from the action by writing it as a time integral of 
some functional L. As this becomes an integral over the set of observers jV, we should also split the 
one-forms u>«, 7^ on A4 into the one-forms u^, 7i and scalars Uft, 74 on J\f, as already described in 
section |2|. The derivatives with respect to the coordinate t become "velocities", denoted by a dot. 
We also need the Levi Civita tensor on M, which is given by £ ^ — £ Or £{j — ^tij 

(giving M a 
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positive signature in contrast to the negative signature of M). Using all this we end up with the 
following Lagrangian: 



L = Jd 2 x e lj Tr(^ 7i + u t D aj + ht F v), I = J dt L 



(3.9) 



To obtain this L we performed a partial integration. This leads us to a problem which has been 
ignored up to now, namely that of the boundary conditions for the fields. It does, of course, not occur 
if space is compact without boundary, and that's the reason why most investigations on canonical 
gravity in three dimensions focussed on the torus, as this is the simplest non-trivial example for such 
a space. 

We divide the boundary problem into two sub-problems. The first is to get a finite Lagrangian, 
the second will be to get the correct equations of motion by adding boundary terms. To get a finite 
action, we somehow have to restrict the asymptotic behaviour of the the fields. Let us assume that 
the manifold M has a finite number of non-compact "ends" , which asymptotically look like cylinders 
(or rather cones, but at the topological level this is the same as a cylinders), i.e. M consists of some 
compact manifold, from which a number of "tubes" extend to infinity. The simplest example for 



such a manifold is the cylinder introduced in section with two ends, but you can easily imagine a 
more complicated manifold consisting of a sphere with N > 2 tubes going off, or even a torus with 
some of these tubes. Note that all these manifolds have trivial tangent bundles, as long as the genus 
of the "compact part" of M is not greater than one. 

Typical boundary conditions in field theory are given as fall-off conditions, i.e. the fields have 
to vanish suitably fast at infinity. In gravity such a condition does not work appropriately, because 
vanishing of the fields means that the metric becomes singular. That's not what we want. We want 
spacetime to be Minkowskian at infinity. So we have to look for different conditions. A first guess is 
to require that the difference of the fields to some set of background fields must vanish suitably fast. 



However, looking at the example considered in section |2.4| , we immediately see that even this is not 
possible. The fields, especially 7^ at r — ► 00, differ by an infinite amount for different values of the 
parameter £. So if we want to have all these solutions included, we cannot restrict the asymptotic 
behaviour in this way. 

On the other hand, in contrast to other field theories, there is an advantage of gravity here: The 
coordinate system is arbitrary. This means that, given such an asymptotic end, instead of choosing 
"local" coordinates r, <j> with = 0+27T and r G [tq, 00), we may as well take another radial coordinate 
p, which is bounded, p £ [po, Poo), and which is related to r by r = r(p) or p = p(r). A typical choice 
could be p(r) = arctanr. Using the p, 4> coordinate system, the integral in ( |3,9| ) runs over a finite 
range only, making the boundary somehow simpler. We can even compactify the manifold by taking 
p G [po, Poo]- The "ends" of AT then look like the boundary considered in figure [p. From the physical 
point of view, what we did is to add some "observers at infinity" to jV. They will play a special role 
below: there will be extra fields defined for those observers only, and they will somehow "glue" a 
fixed background Lorentz frame to the boundary of N . You can think of this as some kind of "fixed 
stars background" , and the extra degrees of freedom to be introduced are the relative positions and 
orientations of the observers with respect to this background. It will be necessary to introduce this 
extra structure to get the correct equations of motion from the Lagrangian. 

However, let us return to the problem of making the Lagrangian finite. The fields may still diverge 
at p — > poo, so we have to impose boundary conditions on them. Let us try the simplest possible 
condition: we require that all the fields can be smoothly extended to the compactified manifold, i.e. 
to p = Poq. Then the Lagrangian is certainly finite, as it is an integral over smooth functions on 
a compact manifold, which from now on will be called N . Let's see whether these are reasonable 
physical boundary conditions. Transferring the spin connection from the p-system to the r-system 
we get the following limits for r — > 00 (note the extra factor dp/dr appearing when transforming the 
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cotangent index p into r): 



u t (r) -» a%, cj (r) -» a; r (r) = p' (r)<jj p {p(r)) -> 0. (3.10) 

The "bared" quantities are those at p = p^, which may depend on <j> (and t, of course), and "— >" 
denotes convergence at r — > oo at least as fast as p'(r) converges to 0: 

/(r)-/oo ^ r hm (/(O-ZooVpV) exists. (3.11) 

These conditions are in fact reasonable, as they just require a gauge choice: 



Exercise 3.4- You can always perform a Lorentz gauge transformation generated by (|3.6| ) such that 
in the r-system we have u r — > 0. The remaining conditions, stating that the limits for and u>t 
exist, then follow by requiring F r t — > and F r ^ — ► 0, which does not impose any further restrictions 
on the solutions of the equations of motion. 

What about the dreibein field 7^? The boundary conditions for them look exactly like those in 
( |3,10| ), but this is certainly not what we want: 7 r —* would mean that the radial lines become lines 



of proper length zero, and 7^ — > 7^ is obviously not satisfied by the example in section 2.4. What 
seems to be missing is a "power of r" , i.e. we want ~f r to approach a constant and 7^ to grow linearly 
with r, so that the end looks like a cone. The trick to achieve this works as follows. The physical 
dreibein field is not 7 M but given by 

% = In + Dtfoo, (3.12) 

where 700 is some sl(2) valued field, which we have to adjust such that we get the correct boundary 
conditions. Note that this is just a transformation of the form (|3.7|), so 7^ and 7^ obey the same 
equations of motion and ([T^) is still the correct action, though 7^ appearing therein is not the 
physical dreibein field. We choose 7oo such that it diverges at the boundary and thus 7^ does not 
obey the same boundary conditions as 7^. What we want is that the radial lines become spacelike 
geodesies with r the proper length of the line, which means g rr = \ Tr(7 r 7 r ) — ► 1 and A-tv — ► 0. 
Using the boundary conditions on u)a, this leads to the requirement 7 r — > 7, where 7 is some (possibly 
^-dependent) matrix such that ^Tr(77) = 1. 

To achieve this, we just have to set ~foo(r, (f>) = r*y(<f)) for large r and smoothly falling off to when 
going towards the interior of J\f. The boundary conditions on the physical dreibein are obtained by 
replacing with 7^ in ( 3.10| ) and then inserting ( [3.12 ): 



7r^7, f yt-r[v t ,i\->'yt, % ~ rd^j - r[u}^,i\ -> 7^. (3.13) 

They look rather awkward but you can see that now 7 r becomes a constant and 7^ becomes linear 
in r. If expressed in terms of the fields 7^ instead of 7^, however, they are as simple as ( [3.10 ) and 



they are even simpler in the p system, where all the fields just have to be smooth at the boundary of 
J\f . We always assume that the fixed field 7oo 

which provides the relation (|3.12| ), does not depend 



on t. This is in principle possible but could cause some trouble with an explicitly time-dependent 
Hamiltonian. 



Exercise 3.5. Show that under the conditions ( 3.13j ) the radial lines asymptotically become geodesies, 



and that the last two conditions follow from the first one by requiring the components D\ r ~it} an d 
Dtj-^fM of the torsion to vanish for r — > 00, so that they do not impose additional conditions on the 
solutions. 

So after all we got a finite action in terms of the fields a^, 7^ on a compactified manifold M, which are 
related to the physical fields existing on the non-compact interior of TV only by ( 3.12| ). We can now 
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forget how this relation was constructed using local coordinates. The relation between the "physical" 
fields 7^ and the "finite" fields (which appear in the Lagrangian and obey the boundary condition 
"well defined on the compactified M") is entirely described by the sl(2) valued field 700, which must 
diverge to spatial infinity, i.e. Tr (700700) — ► +00, at the boundaries of J\f. Our background structure 
is a compact manifold M with some boundaries looking like figure |], and a fixed field 700 describing 
how the fields on M are related to the physical fields on the non-compact interior of N . 

Exercise 3. 6. Given a path a in jV, then the "physical" distance V a is related to the "finite" distance 
V a by 

V a = V a + E/ a7oo (a(l)) U- 1 - 7oo(«(0)), (3.14) 
where the finite and physical distance is obtained by (2.38) with 7^ and 7^ inserted, respectively. 



Hence, the finite distance of a path can be obtained by measuring the physical distance and "sub- 
tracting" the 7oo field (which has a fixed value for every observer). As V a has a finite limit if the 
path approaches the boundary, you can also measure the "distance" of the observers at infinity. 

Let us now investigate the dynamics induced by the finite Lagrangian L. What we want is, of course, 
to get the equations of motion ([3l]) back. It is easy to read off the canonical variables and Lagrange 
multipliers, because the Lagrangian is given in the first order standard form L = qp — H, where 
q are the configuration variables u>j, p the conjugate momenta £ lJ yj, and H is the Hamiltonian, 
depending on the phase space variables and the multipliers ufy, 7^, whose velocities do not enter 
the Lagrangian. We could introduce Poisson brackets right here, but we postpone it until we have 
our final Lagrangian. Let us first check whether L really provides the correct equations of motion. 
Variation of the Lagrangian with respect to the multipliers yields the constraint equations 

e^Daj ~ 0, \e ij F l3 « 0, (3.15) 

corresponding to the spatial components of the equations of motion (|3.5|). They define a subset of the 
phase space, which is called "physical phase space" or "constraint surface" . We introduce the usual 
notation "~" for a "weak" equality, i.e. a relation between phase space functions which becomes an 



equality on the physical phase space. Hence, ( 3.15 ) is just a definition of the symbol 



To simplify the notation, we derive the remaining Euler Lagrange equations as follows: if we vary 
a field whose time derivative enters the Lagrangian, we write 

6m tf] = *± 54> + S 4 = & - 1 S ) to (3.16) 
H S4> K H dt 5(f) ' 

which means that in all these variations we integrate the time derivative by parts if it acts on the 
variation of the field, so that we can directly read off the Euler Lagrange equation. With spatial 
derivatives and partial integrations, however, we have to be more careful, because the space M has 
boundaries, and the boundary conditions do not require the fields to be fixed there. 
If we now vary 74 , we find 

5L = J d 2 x e ij Tr(u^7,- + u%A*Yi) 

= J d 2 x & Tr(F ti 5 7i ) + E / d< ^ T r(^70)- (3-17) 

Here we picked up a boundary term after a partial integration, using Stokes' theorem 

J d 2 x e ij diXj = j> tyXfa ( 3 - 18 ) 
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where <j> is the periodic coordinate on the boundary. The sum runs over all boundaries of N . They 
all have to be parameterized in the same direction, and the Levi Civita tensor has to be normalized 
such that e p ^ = +1, where p is the local coordinate pointing towards the boundary. Note that the 
compactification of M now makes it very simple to deal with such integrals, as all the fields are well 
defined on the boundary, where the right hand side of ( |3.18 ) is evaluated. The symbol § denotes 



integration over any range 4>o to <fto + 2ir. The fields u t , 7^, etc., are considered as periodic functions 
of 6. 



Unfortunately, we do not get the correct equations of motion. Requiring (3.17) to vanish for all 
5~fj gives the correct equation Fu = 0, but in addition we get = on the boundary. A similar 
problem arises for the variation of the spin connection, where we have 

5L = J d 2 x e ij Tr( - Su^y, + uti^Jj] + 7tA<V?) 

= J d 2 x e ij Tr((A7i - A7t) <H) + E / ^ Mlt^). (3.19) 



Here we integrated the time derivative by parts as described in ( |3.16 ). Again, we get the correct 



time components of the torsion equation D^jq = 0, but we also get jt = on the boundary. What 
would it mean if the values of the multipliers are fixed at the boundary? The freedom to choose the 
values of the multipliers corresponds to gauge symmetries. A consequence would be that we loose 
some gauge degrees of freedom. This would not be too bad, but to maintain the correct number of 
physically equivalent states, we had to "fix the gauge" somehow on the boundary. 

Exercise 3. 7. An alternative way to deal with boundary conditions is the following: find a suitable 
gauge fixing for the fields 7^ and on the boundaries such that together with u t = and 7< = 
you get the same final result for the reduced phase space. You have to make the 700 field time 
dependent to get a non-singular metric at infinity. However, this is rather cumbersome and at least 
as complicated as introducing the auxiliary fields below. 

We can try to avoid these problems be adding the extra term 

L' = -Y,i*4> iHTtw* + "ti<t>) ( 3 - 2 °) 
1 J 

to the Lagrangian. But then the equations of motion for the multipliers uit and 7^ are no longer 



given by ( 3.15 ); we get extra constraints uty = and 7^ = at the boundary. This is just the kind of 
boundary condition we rejected above, because the simple example from section did not satisfy 
them. The only way out of this dilemma is to introduce extra auxiliary fields on the boundary. They 
look rather unmotivated first, but as already mentioned we will find a nice physical interpretation 
for them in the end. 

For each boundary of J\f we introduce a pair of fields u{4>) £ SL(2) and v(cf>) S sl(2). They are 
not periodic in (f>, so consider them as fields on a real line instead of a circle, or as fields on the 
boundary of M as defined by the family of paths in figure [l], instead of fields on the boundary of Af. 
However, they are not completely arbitrary. We require them to be quasiperiodic (see definition |L~g| ), 
i.e the pair (u,v), considered as an ISO(l,2) valued field, obeys the relation 

(u, v)(<j) + 2tt) = (u e , v e ) (u, v)(ct>), (3.21) 

with the multiplication as given by ( |1.30|) . Written out in components, the relation becomes 

u((j) + 2tt) = ugu(<fi), v((f> + 2ir) = V£ + U£v(4>) u^ 1 . (3.22) 

Quasiperiodic fields of this type will play a very important role below. They are used to classify the 
complete solution of the constraint equations. By taking derivatives, there is an alternative definition 
for quasiperiodic fields: 
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Exercise 3.8. Show that ( 3.22| ) is equivalent to the requirement that the following s 1(2) valued deriva- 



tives of u and v are periodic: 

u} (f> = u~ 1 d ( j ) u, % = u~ 1 d (j ,vu. (3.23) 

From these relations we infer that u (as well as v) has one degree of freedom more than a periodic 
field, which is somehow encoded in the integration constant for the differential equation u~ 1 d ( pu = uj^ 
(or the corresponding relation for v). You can also consider the constant ui (or V() as this extra 
degree of freedom. 

To get the correct equations of motion for the dreibein and the spin connection, we now add a 



term to L which looks very much like ( 3.20 ): 



L' = W d</> Tt(7t(% " + ~ l<t>)) (3-24) 



If we now vary cjj or 7$, the boundary terms in ( 3.1 7| ) and ( 3.19| ) cancel against the variation of L'. 



What we get is the correct time evolution for the dreibein and the spin connection 

Ui = DiUft, 7i = Dat ~ [wt,7»]. (3.25) 

Variation of the multipliers no longer yields u>^ = 7^ = on the boundary, as we found for (jjT2C|) . 
Instead, we get new constraint equations relating the boundary fields u, v to the connection and 
dreibein field tangent to the boundary: 



u 



l d<j ) u = u^, u 1 d 4> v u = 70 ~ J<f>- (3.26) 



This does not impose any new constraints on u>j or 74, because the differential equations are globally 
integrable for any arbitrary pair of periodic fields and 7^,. However, it seems that we only shifted 
the original problem somewhere else. As we have introduced new fields, we also get additional 
equations of motion for these fields. A variation of v, e.g., would immediately give ut = 0, which is 
exactly what we had before. But we may still add terms depending on the new fields only. 

What we need is a kinetic term for the new fields, which makes the field u a configuration variable 
and v the conjugate momentum. It is 

L kin = (TrfaT^Mfa)) ~ <f d<f> Tr(uu-%v)). (3.27) 
The notation ^ Q indicates that the range of the integral is </>o to cfto + 2tt. 

Exercise 3.9. Show that the value of L^ n does not depend on cpQ. The integrand itself is not periodic, 
but the first term compensates this non-periodicity. 

At the moment there is absolutely no motivation for this awkward looking boundary term. The only 
reason to choose this is that now we get the correct equations of motion. It is straightforward but 
rather complicated to compute these equations, as we are dealing with group valued quasiperiodic 
fields. Only those variation are allowed which preserve ( 3.22j ). The actual derivation of the equations 



is given in the appendix. Requiring the action to be stationary implies 

u = uut, v = ujtu 1 = i>£ = 0. (3.28) 



Hence, ( 3.25| ) and ( p. 28 ) provide the full set of evolution equations for the fields. Finally, we can 



check that these are consistent with the constraints in the following sense: given any solution to the 
constraints as initial condition for the time evolution equations, then the constraints are automatically 
satisfied at any later time. In Dirac's formalism such constraints are called first class, and we will 
see below that this is equivalent to the fact that the Poisson algebra of the constraints is closed. 
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Exercise 3.10. Show that under the time evolution the relations ( 3.22| ) and all the constraint equa- 
tions ( 3. 15j ) and (3.26) are conserved for arbitrary values of the multipliers ujt,~ft- As a consequence, 
those cannot carry any physical information, as their time dependence is completely undetermined. 
All physical information is contained in the phase space variables {ujj, 7j, u, v}. 



3.2 Poisson brackets, gauge symmetries and constraint algebra 

We now have a well defined Lagrangian depending on the multipliers u%, -ft, the configurations 
variables u>j, u, and their conjugate momentum variables -ji and v. The Lagrangian is first order in 
time derivatives, so it should be straightforward to read off the Poisson brackets. For the interior 
of J\f it is in fact easy, as the canonically conjugate momentum of Ui is £ l3n /j. To avoid confusion 
with the matrix indices, we will not use brackets where both entries are matrices. Instead, we use 
"dummy" sl(2) matrices and take traces to write down the brackets. The basic bracket between the 
spin connection and the dreibein reads 

{Tir(a7 i (y)),w i (ar)} = e tj aS(x,y), (3.29) 

or vice versa 



{Tr(owi(y)),7,-(a;)} =e ii aS{x,y), (3.30) 

and brackets between components of the connection or the dreibein alone always vanish. Instead of 
tracing the momenta, we can also switch back to the vector representation for 51(2) matrices (by 
taking a = ^7 a ), where the same relation reads 

{ei a (y),u>3(x)} = \e l3l a 5(x,y), (3.31) 

So far this was not too sophisticated. However, if we now consider the boundary term, things get 
more involved. Not only that the fields are group valued, the quasiperiodicity makes it hard to find 
the brackets. It is not easy to derive the brackets of u with v in the usual way, and those of v with 
itself are even worse. It turns out that there is only a subset of brackets that is actually needed, 
namely those where at least one entry is one of the periodic fields and 7^ introduced in ( 3.23| ), 



The rather technical derivation of the complete brackets is given in the appendix. All it's commute, 
and thus also the bracket of u with vanishes. We have 

{u(</>), TY(a^(0'))} = O, 

{v(cft), TV(au>^'))} = -u{(ft) a u' 1 (eft) 5 (eft - (ft'), (3-32) 

where 5((ft) is the periodic delta function on the circle, i.e. it has a peak whenever (ft = 2nz, zeZ. 
The brackets with 7</> are 

{v(4),Tr(a%W))} = 0, 

{u(cft), Tr(a%(cft'))} = -u{<ft) a 5(<j> - 0')- (3.33) 

Note that they are periodic in (ft' but quasiperiodic in eft. In particular, the quasiperiodicity parameters 
ui and V£ commute with both and 7^- Differentiating them with respect to (ft and using the 
definition ( 3.23j ) we obtain the following brackets between and 7^: 



{ Tr(a^(0)), Tr(67 (<//))} = Tr([a, b]^) 5{cft - eft') - Tr(ab) 5' ((ft - (ft'), 

{ Tt(a%((ft)), Tr(b7^'))} = Tr([a, b]%) 5(cft - (ft'). (3.34) 
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We can now use these brackets to reproduce the equations of motion. We first define the "smeared" 
constraints. Given any two sl(2) valued fields A and n on jV, they are 

L[X] = J d 2 x e li Tr(AA7j) + E / d< ^ Tr ( A (^ - 7*)) ~ °> 

H[n] = J d 2 x \e ij Tir(»%) + E / d< ^ Tr H<^ ~ ^)) ~ °- ( 3 - 35 ) 

The constraint equations tell us that these quantities have to vanish for any pair of fields A, n. 
Remember that the integration runs over the compactified manifold jV, thus A and n have to be 
smooth at the boundary and the integrals are always well defined. Using this notation, we can write 
down the Hamiltonian as 

H = -L[ut] - H[~f t ] « 0. (3.36) 

Thus H is a linear combination of constraints, the coefficients given by the Lagrange multipliers. It 
vanishes for any solution of the constraints, i.e. on the whole physical phase space. This reflects the 
fact that our "time" t is unphysical. As a consequence, there is no "energy" of a physical state which 
is usually represented by the Hamiltonian defined with respect to the physical time. However, the 
Hamiltonian still generates the time evolution, but with respect to the parameter time t, by taking 
Poisson brackets. To compute these brackets, it is more suitable to perform a partial integration in 
( [3.35 ), such that there are no more derivatives acting on cjj or 7j. Using Stokes' theorem we find 



L[X] = - Jd 2 x e« Tr(AA7i) + E / d< ^ 

H[n] = - Jd 2 x e ij Tr(d;n w,- - nu^j) + ^ I dcp Tr(no; ). (3.37) 



With the brackets above we obtain 



{L[\},u i } = D i \, {L[\],u} = u\, 

{L[\],j i } = [j i ,\}, {L[X],v} = 0. (3.38) 



and 



{H[n],cj i } = 0, {H[n],u} = 0, 

{H[n],fi} = Di-n, {H[n],v} = unu 1 . (3.39) 

There are two important facts summarized in these equations. First, observe that the right hand 
sides are exactly the Lorentz transformations (|3.6[) and translations ( |3.7|) , but now they act only on 
the canonical fields on M instead of the fields on the spacetime manifold M. In addition, we get some 
new transformations of the extra boundary fields, and one can check that these are indeed necessary 
for the complete action including the boundary terms to be invariant under gauge transformations. 
Hence, we found an explicit representation of the operators introduced in ( |3.6| ) and ( p.7[ ) as Poisson 
brackets with the constraints. We therefore call L the Lorentz constraint, and H is called Hamilton 
constraint: in ADM gravity using the metric as field variable, it splits into the generator of space 
and time diffeomorphism, and the name Hamilton constraint was introduced because it somehow 
replaces the Hamiltonian as the generator of time evolution. 

Secondly, we see that the equations of motions are reproduced by the Poisson brackets as the 
Hamiltonian is a linear combination of L and H, and we only have to replace A and n with Ut and 
7t to get (|3.25|) and ( |3.28 ) back. For any phase space function we have 



F = {F, H} = {L[uh], F} + {H[j t ],F}. (3.40) 
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The arbitrariness of the multipliers reflects the fact that time evolution is unique only up to gauge 
transformations. In addition, as our "time" t itself is completely arbitrary, time evolution is in 
fact nothing but a gauge transformation. The action is invariant under any reparameterization of 
time, which is part of the gauge symmetry, so time evolution is formally equivalent to a gauge 
transformation. 

We can now also check that the constraints are first class in the sense of Dirac, i.e. that the 
Poisson algebra of L and H closes. We already did two similar calculations: we have seen that the 
"linear operators" H and L defined in ( |3.6| ) and ( |3.7| ) do form a closes algebra, and we saw that the 
constraints are preserved by the time evolution. 

Exercise 3.11. Compute the Poisson algebra of the constraints and show that 

{L[\ 1 ],L[\ 2 ]} = L[[\ 1 ,\ 2 ]}, 
{L[X],H[n]} = H[[X,n]], {H[m], H[n 2 ]} = 0. (3.41) 

You have to be careful with partial integrations and boundary terms to get the correct results. As a 
consequence, given any two (smooth) phase space functions vanishing on the physical phase space, 
F « 0, G 0, then we also have {F, G} 0. 

We can now give an exact definition of what we mean by a gauge transformation, which in Dirac's § 
canonical framework is a transformation generated by taking Poisson brackets with first class con- 
straints. 

Definition 3.1 (Gauge Transformation). Two physical field configurations <f>o and (f>\ (denoting the 
set {ui, 7i, u, v}) are called "gauge equivalent" or "related by a gauge transformations" if there is a 
smooth deformation S , s G [0,1], such that 

-^ a «{L[A a ]+H[n,],0,}, (3.42) 
as 

with some parameter fields X s ,n s , which may of course vary with s. 

Field configurations related by a gauge transformation are assumed to represent the same physical 
state. All other symmetries of the action are called global symmetries and they map different phys- 
ical states onto each other. We will see this quite explicitly for an example below, so it is really 
important to distinguish between these symmetries. In context of gravity those global symmetries 
are sometimes called "large diffeomorphisms" or "large Lorentz rotations", because they cannot be 
cut into (infinitesimally) small pieces. 

To identify the gauge invariant objects we will need later on, we have to know how the finite 



gauge transformations described in definition 3.1 look like, i.e. we have to go over from the gauge 



algebra to the gauge group. We formally get it by exponentiating the "infinitesimal generators" L 
and H. This is quiet easy for the translations as the exponential series is of first order only. A finite 
translation is given by 



U>i I— > tJj, u^u, 

-1 



7i ► li + Din, v i — > v -\- unu . (3.43) 

For the Lorentz symmetry exponentiation is not as simple, but of course we know how to do it. 
Instead of an sl(2) valued field A the parameter becomes a group valued SL(2) field h and the 
Lorentz rotation reads 

<*V, i-> h" 1 (d fJ/ + u — > uh. 

7 M i ► h^-y^h, v h-> v. (3.44) 
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However, here we have to be a little bit more careful again. In section |L2| we saw that the "Lorentz 
group" is actually SL(2) and not SL(2). So shouldn't the parameter of the Lorentz rotation better 
be an SL(2) field hi Or, the other way around, couldn't we take an S0(l,2) field h a i, as parameter, 
and what's the difference? We can rewrite ( |3.44j ) in the vector representation and get 

Ufiab i-> hPairjeddft + u)^ cd )h d b , e/ h-> h b a ej>. (3.45) 

However, here we already get into trouble with the transformation of u. Ignoring this problem for a 
moment, we could equally well replace the boundary field u G SL(2) with u a b G S0(1, 2)), so that we 
don't have to assume that h% is the projection (1.16) of some field h onto S0(l,2). The symmetry 



transformation, at least for the gauge fields Ui and -ft, is well defined for any S0(1, 2) field h%. The 
difference between ( |3.45| ) and ( 3.44} ) is that there are S0(1, 2) valued fields which are not projections 
of SL(2) valued fields. 

Exercise 3.12. Show that there are such fields on a manifold J\f if and only if the loop group of J\f is 
non-trivial, i.e. T ^ {rj}. 

The question is: shall we consider those as gauge transformations or not? They are certainly sym- 
metries of the action. However, the definition above tells us that they are not gauge symmetries in 
the sense of Dirac. 

Exercise 3.13. Show that fl3.43| ) and ( |3.44| ) are gauge transformations if and only if h is a projection 
of some SL(2) valued field h on Af (see exercise |1.8| ). As an example, take the cylinder manifold with 



the fields given by (2.54). With our boundary conditions, these are in fact the 7^ fields. Choose 
Too = r 7i to transform to the finite fields. Then perform a Lorentz transformation with parameter 
h(r,4>) = exp((/>7o) (which is quite simple). Transform back to the physical fields y». You will end 
up with a different value of £, showing that physics has changed by choosing a field h as parameter 
for the Lorentz transformation which is not a projection of any h. 

Exercise 3.14- Under the finite gauge transformations, the transport operator and the distance of a 
path changes by 

U a 1 — * h~\a(0)) U a h(a(l)), V a >-> h~\a(0)) V a h(a(0)) (3.46) 
for a Lorentz rotation with paramter h : A/" — * SL(2), and 

U a h-» U a , V a \->V a + U a n(a(l)) U^ 1 - n(a(0)), (3.47) 
for a translation with parameter n : M — > sl(2), respectively. 

3.3 Solving the constraints 

To find classical solutions of the equations of motion, all we have to do is to solve the constraints. 
Time evolution of the so found states is then given by choosing suitable values for the multipliers and 
computing the brackets with the Hamiltonian. A "suitable" choice would be u>t = and yt = 70, 



corresponding to free falling local observers (see exercise 2.9). A second step will then be to classify 
the equivalence classes of the solutions modulo gauge transformations, which gives an overview over 
all physically inequivalent classical states. 

Let us first consider the interior equations ( |3.15| ). The Hamilton constraint depends on cjj only 
and states that its field strength vanishes. The solution for such an equation is well known. It can 
be locally parameterized by an SL(2) valued field g and reads 

= Q^dig. (3.48) 
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However, such a parameterization exists locally only and getting a parameterization for the global 
solution is not trivial. Before showing how to solve this problem, let us solve the Lorentz constraint 



locally, too. Inserting ( 3.48 ) we can rewrite it as 

e lj g- 1 d l (g lj g~ 1 )g = 0. 



(3.49) 



It now states that the curl of the covector inside the parenthesis vanishes and therefore there is 
(locally) a "potential" / G s\{2) such that 



gag 



-i 



dif 



ti- 



ll 



Mfg. 



(3.50) 



How can we find the global solution? The answer is that we cannot define the "potentials" g and / 
globally on jV, but we can do it on the covering manifold J\f . In addition, we already learned that 
it might be better to take an SL(2) field g instead of g G SL(2) right from the beginning. This will 
avoid confusion later on, as the gauge invariants are in fact SL(2) valued. 

Remember that the elements of M are homotopy classes of paths in N starting at a special point 
x. Hence, let us consider this point as a special observer and what follows will describe the universe 
as seen by this observer. Given a solution to the constraint equations ( 3.15| ), we will use the non-local 



objects from section 2.3 to define the following fields on M: 



g(a) = U a , g(a) = U a , f(a) = V a . 



(3.51) 



Of course, g is just the projection of g onto SL(2), so it is not really an independent field, we just 
need it to derive some explicit formula. Remember that the transport operator and the distance of 
a path are invariant under smooth deformations of the path, provided that the dreibein postulate 
holds and the curvature of the spin connection vanishes. Hence, these fields are well defined on M 
if the constraints are satisfied. If we now vary a G N without keeping a(l) fixed, we can use the 



spinor representation of (2.27) to get 



SU a = U a (*M Sa l SV a = U a 7, 5a % U a 



(3.52) 



where 5a l is the variation of the end point of a. Using the same coordinates on J\f and J\f, Sa l is 
also the variation of the point a G M, so these two equations tell us how the derivatives of g and / 
look like. Explicitly, we get 



Mg 



Mfg 



a- 



(3.53) 



Note, however, that these equations "live" on M, and Ui and ~fi are the pullbacks of the gauge 
fields onto N . As and 7« are periodic, we are using the same symbol for a field on TV and the 
corresponding periodic field on J\f . 

Obviously, we can recover 7, and uJi from g and /, and we get exactly ( |3,48| ) and ( 3.50| ). Thus 
the potentials g and / parameterize the solutions ( 3.15j ) completely. But not every arbitrary set of 
potentials on J\f corresponds to such a solution on J\f: the left hand sides of ( p. 53 ) have to be periodic 
on J\f . We already dealt with these kinds of conditions for the u and v fields above, so we expect the 
potentials to be quasiperiodic. Indeed, this time the condition is even stronger: 



ISO(l,2) defined by ( 3.51 ) is normalized quasiperi- 



Exercise 3.15. Show that the field (g, f) : J\f 
odic, i.e. we have 

{g,f)°h = (g\,f\){g,f) 

for all A G T, with g\ = g(X) and f\ = /(A). In components: 

g{\*a) =g\g{a), /(A • a) = f x + g\ f{a) gx 1 . 



(3.54) 



(3.55) 
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Every solution to the constraints is mapped onto a pair of normalized quasiperiodic potentials. We 
already saw that this map is injective, as we were able to recover the fields from the potentials. On 
the other hand, if (g, f) is a set of normalized quasiperiodic potentials, we find that the gauge fields 
derived via ( 3.53; ) are periodic on jV, and of course they solve the constraints. Hence, there is an 



inverse map from the set of normalized quasiperiodic potentials to the solutions of the constraints. 
This inverse map is injective, too: if there are two potentials yielding the same gauge fields via ( |3.53| ). 
then they can differ by a constant only: g' = g g, f = fo+gofgo 1 , or (<f , /') = (g , f )(g, /). But 
normalized quasiperiodic fields always obey g{rf) = 1, f(rj) = 0. This gives go = 1, fo = 0. So the 
map is in fact one-to-one. 

Now what about the boundary fields u and vl We have to solve the boundary constraints ( |3.26| ), 
which relates the gauge fields on the boundary to the quasiperiodic fields u and v. To insert the 
parameterization of the gauge fields in terms of the potentials on jV, we use the boundary paths 
introduced in figure |l[ We get the gauge fields and 7^ tangent to the boundary by fixing any 
family of paths a^A, and using the formulas ( |3,4Sj ) and ( |3.50D . Hence, the boundary constraints are 

g' 1 d r f > g(ae >( j ) ) =vT 1 d <t> u(4>), 
g~ 1 d,pf g(ae :( p) =u~ 1 d (j) vu(<p). (3.56) 

The solution to these equations is that (u, v) differs from (g, f) only by multiplication with a constant 
(b£,Cp)~ 1 from the left, or explicitly 

u{4>) = bj x g(ai^), v{4>) = bj 1 (f (a^) - c e ) b e , (3.57) 

where bg £ SL(2) and eg £ sl(2). These two variables (for each end of AT) represent the additional 
physical degrees of freedom introduced with the boundary fields u and v. They depend on the special 
family of paths chosen in ( |3.57 ), so we always have to fix this family before introducing bi and eg. 



Nevertheless we now have a complete parametrization of the solutions to the contraint equations, 
and we can summerize the result as follows. 

Theorem 3.1. There is a one-to-one map from the set of physical field configurations on M onto the 
set of normalized quasiperiodic functions N — * ISO(l, 2): 

{fa%(x),7i(a;),ii(0) } «(0)} < — ► {g(a),f(a),b e ,c e }. (3.58) 

It is explicitly given by 

g{a) = U a , b £ = U at ^u~ 1 {<t>), 

f(a) = V a , q = V atA - U a ^vT\4>) v(<j>) u{4>) U-\, (3.59) 

where ai^ is a fixed family of boundary paths as defined in figure 0. The right hand sides are 
independent of <p. The inverse map reads 

Ui(x) =g~ 1 d i g(a x ), u{4>) = bi 1 g(a^ <t> ), 

li{x)=g~ 1 d i f g{a x ), v{4>) = bi 1 (f(a^ <f> ) - c e )b e , (3.60) 
where a x £ M is any path with n(a x ) = x £ J\f. 

This theorem tells us what the solutions to the constraints are, or how the physical phase space looks 
like. It does not tell us whether two solutions are related by a gauge transformation and represent 
the same physical state. What we are looking for is the so called reduced phase space. 

Definition 3.2 (Reduced Phase Space). The set of physically inequivalent states, i.e. the set of equiv- 
alence classes of solutions of the constraint equations modulo gauge transformations, is called the 
reduced phase space. A physical state is uniquely given as a point in the this space. 
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Hence, the reduced phase space gives a complete classification of the possible physical states of a 
system, and what we have to do is to find the equivalence classes of the sets {g, f ,bg, eg} under 
Lorentz rotations und translations, which were given in ( 3.43| ) and ( 3.44[) . Remember that though h 
appears in the finite Lorentz rotation, the parameter is actually an SL(2) field h on M. We can use 
( |3.47 ) and what we get is 

g i ^ h^gh, b t i-> ho L b e , 

f^ho 1 fh , ce^h^ceho, (3.61) 



for the Lorentz rotation and 



/>->•/- no + gng , Qhq - n , (3.62) 



for the translation, where ho = h(x) and no = n(x). To simplify the notation, we can combine both 
gauge transformations into a single IS0(1,2) transformation 

(9, f) (^o, n )~ l (g, /) (h, n), 
(be,Ci) h-> (/i ,n-o) _1 (^,C£)- (3.63) 

Again, we are using the convention that the same symbol denotes a field on N and the corresponding 
periodic field on N . Hence, instead of using fields on N we can use periodic fields h and n on N as 
parameters of finite gauge transformations. We can now proof a second theorem which gives us part 
of the classification of gauge invariant parameters of solutions: 

Theorem 3.2. Two configurations {g, f, bi, cf\ and {g' , /', b' e , c^} are related by a gauge transforma- 
tion if 

9\ = 9x, fx = fx, bit = b' e , a = 4, (3.64) 

for all A G T and all ends £, i.e. if the potentials coincide on the loop group and the boundary 
parameters are equal. 

The proof is quite simple. Define (h,n) = (g, /) _1 (g', /'). This is periodic, because 

(h,n)ob x = {g,f)-\gxJx)- l {gxJ'x) (g'f) = {gJT\g'f) = (h,n), (3.65) 

and we also have (ho, no) = (1, 0). Thus (h, n) can be used as a parameter for a gauge transformation 
( p. 63 ), which transforms (g, f) into (g',f). 

As a result, we find that an equivalence class of solutions is entirely determined by the discrete 
set of variables {g\, f\,be,cg}. We call them the "holonomies" . They are still infinitely many, but 
they are not independent. From the quasiperiodicity conditions ( 3.54 ) we infer that they have to 
provide a representation of the loop group: 

[gx.K, fx.n) = (gx, fx)(g K , /«)■ (3.66) 

It is obviously sufficient to specify the values of g\ and f\ for the primitive loops introduced in 
definition L6, all others are then given by linking them and using ( 3.66j ). So there are only finitely 
many parameters left to classify the physical states, showing that the reduced phase space of three 
dimensional gravity on a reasonably well-behaving manifold is finite dimensional. But we still don't 
have a full classification of physical states. First of all, even the finite set of "primitive holonomies" 
might not be independent, as there may be relations between the primitive loops which have to be 
respected by the holonomies. A standard example is the torus, where the relation A»k»A _1 = r] 
leads to the restriction 

(gx, fx)(g K , f K )(gx, fx)~\g K , UT 1 = (i,o) (3.67) 
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for the primitive holonomies, which becomes rather complicated when written out in components. 
More complicated relations hold for higher genus manifolds, and it is getting even difficult to find 
the range of the primitive holonomies. 

In addition, different holonomies not necessarily correspond to different physical states, as there 
are still gauge transformation acting on them. They are those gauge transformation which did not 



occur in the proof of theorem 3.2, i.e. for which the parameters ho and Uq at x do not vanish. 
Evaluating ( |3.63 ) for the holonomies gives 

(9\,fx) >-> {ho,n Q )~ x (gx, f\){h ,n Q ), 

(b e ,c e ) (/i ,n ) _1 (^,c £ ). (3.68) 

Hence, for the holonomies corresponding to the loops the gauge transformations are inner automor- 
phisms of IS0(1,2). Form definition 1.13| we know that we can use an IS0(1,2) paramter (ho, no) 



instead of the ISO(l,2) parameter (ho, no) appearing in ( |3,63j ), as inner automorphisms of the cov- 
ering group are completely parameterized by elements of IS0(1,2). 

The boundary holonomies transform by multiplication from the left only, and therefore it is quite 
easy to find the equivalence classes under gauge transformations, provided that we have at least one 
end. Calling it £ = o, then the quantities 

(b 0> i, c 0j i) = (b , Co)" 1 (b e , c e ) for £ / o, 

(9o,£, fo,e) = (b Q , c ) _1 (5a, fx) (b Q , c Q ) for A G T , (3.69) 

are gauge invariant. Note that again we are multiplying SL(2) elements with SL(2) elements in the 
second line, but b Q acts as an inner automorphism so that the operation is well defined. These 



variables provide a full classification of the classical solution: by exercise [L3I we know that the loop 
group of a manifold with boundary is free, so the reduced phase space is just a direct sum of "number 
of primitive loops" times IS0(1,2) plus "number of ends minus one" times IS0(1,2). If there are no 
"ends" of the space manifold J\f, we cannot use the boundary holonomies to obtain jjauge invariant 
quantities. In this case, one really has to find the equivalence classes of sets of IS0(2) elements 
(obeying extra relations) modulo inner automorphism. 

Finally, let us consider the time evolution of the holonomies. As we know how they transform 
under gauge symmetries, we only have to replace the parameters by the time components of the 
gauge fields u t and 7t to get the time evolution. We find 

9\ = [g\,u t (x)}, b e = -u t (x)b e , 

f\ = [f\,u t (x)]-7t(x)+g\lt(x)g\ 1 , ce = [c£,utt(x)] — 7t(x). (3.70) 

We see that they only depend on the value of the multipliers at x, because we eliminated all gauge 
degrees of freedom except those at this special point. In the first equation we gave the time evolution 
for g\ instead of g\, which cannot be written as a simple commutator, as it is not a matrix. However, 
the time evolution acts as a Lorentz rotation only, so it is obvious how the time derivative of g\ looks 
like. We can also integrate the time evolution by using the "evolution operators" ( 2.44| ). We then 



get formally the same as ( 3.68 ) for the gauge transformations, and here we can insert g\ again: 



(gx,fx)(t2) = (U s ,V s )(t 2 ,h) (g x ,fx)(h) (U s ,V s )(t u t 2 ), 
(b e , Ci )(t 2 ) = (U^,V^)(t 2 ,h) (b^ceXh). (3.71) 

For free falling observers the time derivatives become very simple, as ut = and jt = 7o : 

g x = b £ = 0, fx = gxlogx 1 - 7o, Q = -70. (3.72) 
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3.4 Cylinder spacetimes 

Before going over to quantum physics, in this last "classical" section we will use the formalism derived 
above to classify the solutions of Einstein's equations on a given manifold. We will not need the results 
of this section later on. All we want to do here is to construct some interesting spacetimes, showing 
that there is a huge variety of inequivalent solutions to Einstein's equations even on the simplest 
possible manifold. Explicit solutions to Einstein's equations in three dimensions were classified in 
[p4|l, and of course the solutions presented here are closely related those given there. For a similar 
approach using dreibein formalism see also [25, 26] for torus and higher genus manifolds, and recently 



the Klein bottle has been studied [27], too. Our picture of spacetime as a set of observers evolving in 
time is a very useful tool to visualize its structure and the various singularities. We take the set of 
local observers to be the cylinder J\f = R x S . For simplicity, we drop all the boundary terms here. 
The boundary terms were constructed such that their existence does not affect on the gauge fields 

and 7j. They are easily restored by choosing arbitrary values for and and using ( |3.60| ). We 
will discuss their physical interpretation in detail in the quantum section. 

The coordinates on N are r, <f> with <p = <fi + 2ir, and we have the same coordinates without 
periodicity relation on M ~ E?. For simplicity, we choose x = (0, 0), so that (r, 4>) G N is represented 
by the path s i— ► (sr,s(f>) G N. All the fields have trivial limits for r — > ±oo, so there is no need 
to introduce the p coordinate from section 13.11. Nevertheless we need the field which relates the 



finite fields 7^ to the physical fields 7^ (see ( 0.12 )), in order to get the correct behaviour of the 



physical fields at infinity. We simply choose 700 = r7i, which obviously diverges to spacelike infinity 
1^(700700) — ► +00 for r — ► ±00. In addition, we "gauge fix" all our observers to be free falling, i.e. 
we always have u t = and jt = It = 7o- 

With the results of the previous section, we know that the solutions are parameterized by a single 
pair (g\, fx), where A = (0, 2tt) G M is the primitive loop of the cylinder. Two such pairs represent 
the same physical state, if they are related by an inner automorphism, which explicitly reads 

9x i-> ho X g x h , f x i-> ho" 1 (/a + gxnoQx 1 - n ) h . (3.73) 

Using the classification of SL(2) and sl(2) elements, we can split the states into three classes, de- 
pending on the value of the parameters: if g\ is non-null, then the state is called timelike, lightlike 
or spacelike depending on the value of g\, otherwise depending on the value of f\. Note that if g\ 
is null, then g\ = ±1, and fx does not transform under translations, so that the squared length of 
fx is gauge invariant. The special cases gx = ±l z ,fx = are included in the timelike states. An 
important property of translations can be used to simplify the complete classification of inequivalent 
states: 

Exercise 3.16. Assume that g\ is non-null, i.e. gx = g(v a ,±) for some vector v a 7^ in figure |2[ 
Then for any vector (or matrix) w a ~f a = w orthogonal to v a (v a w a = or Tr (g\w) = 0), and only 
for those, we can find a translation that adds w to fx- 

Note that the vector added to fx by a translation is the difference between no and the Lorentz 
transformed "vector" gx n og\ l ■ What the exercise says is that every vector that is orthogonal to the 
"axis" of this Lorentz rotation can be written as such a difference. This is quite obvious in Euclidean 
three dimensional space, but holds in Minkowski space as well. 

Let us start with the timelike states: then either g\ is non-null and lies in the timelike regions of 
figure ||, or fx is timelike. It is obvious, that there is always a Lorentz rotation that maps a timelike 
vector onto the vertical axis, so in the first case we can assume that gx G SO (2) (which is the vertical 
axis in figure 0), in the second case we can assume that fx lies on the 70 axis. If gx is non-null, then 



we still have the freedom of making a translation acting on f\. From exercise 3.16 we know that we 



can add any matrix lying in the 71 2 plane to fx, so that finally both parameters lie on the 70 axis: 

gx = exp(Cvr7o), / A = 2vrZ7 . (3.74) 
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Remember that the exponential of the 7 "matrices" was introduced as a formal notation for the 



exponential map sl(2) — » SL(2) in section 1.2. The timelike states are therefore completely param- 



eterized by two real numbers £, I. There is no time evolution of the holonomies, because the right 



hand sides of (3.72) vanish. Hence, timelike states are static. To find a field configuration 0^,7^ 
leading to these holonomies, we first have to find a pair of quasiperiodic fields (g, f) on M. This is 
not difficult, we take 

g(r, <t>) = exp(±C</>7o), f(r, (/>) = /</>7o (3.75) 
Using ( 3.60 ) we compute the gauge fields 

^ = 5C7o, u: r = 0, 7 = ^7 O , 7 r = 0. (3.76) 



Transforming to the physical dreibein field by ( 3.12| ) gives 



It = 7o, 7r = 7i, % = C^72 + 1 7o (3.77) 



Obviously, for I = we get the "cones" considered in section 2.4. There we found that the "central 
cylinder" (t, r = 0, (f>) actually represents a line in spacetime only (the worldline of the peak of the 
two cones glued together at r = 0), as points with different 4> but the same t can be connected by a 
path of zero distance dx a = everywhere. For / 7^ this singularity is different. Writing the distance 
as an s[(2) matrix we get 

da; a 7 a = (dt + /d0)7o + dr 7l + Crd0 72 . (3.78) 

This vanishes if r = 0, dr = and dt = —Idcj). Hence, there is still a singularity at r = 0, but this 
time the events to be identified lie on a helix winding around the central cylinder, rather than being 
the circles with constant t. The path specified by t + /</> = to, r = represent one spacetime event 
only. But to and to + 27r/Z yield the same path, so the set of these lines, the "singularity", forms a 
circle. However, it is still timelike, and we can measure its geodesic distance, which is 2itI~yq. Hence, 
the singularity is a timelike circle of "radius" |/|, and the sign of I tells us in which direction time 
runs around the circle. 

As the singularity is a timelike closed path, one can expect that there are such paths outside the 
singularity as well. In fact, the circles at constant r and t become timelike for r < For large r, 
the £r term in dx a dominates the d<p part, so far away from the singularity the universe looks like a 
cone. The metric itself is obtained by taking the square of the "distance" above: 

ds 2 = -(dt + I d(/)) 2 + dr 2 + (Cr) 2 d^. 2 . (3.79) 

Outside r = 0, this describes the gravitational field of a spinning particle at the origin. Approaching 
the particle, the lightcones become more and more tilted into the angular direction in which the 
particles rotates. There is also a corresponding four dimensional spacetime (and this also holds for 
all the examples below), which can be obtained by adding dz 2 to the metric, where z is an additional 
global coordinate. The timelike states then represent spinning cosmic strings. 

The spacelike case is somehow dual to the timelike, and can be obtained by exchanging the r and 
t direction, and 71 with 70. However, there are additional spacelike states leading to very peculiar 
spacetimes, which have no timelike counterpart. This is due to the fact that timelike elements of SL(2) 
can always be written as exponentials whereas spacelike cannot. However, by the same arguments 
as above we can always perform a gauge transformation such that cj\ and fx lie on the 71 axis. Use 
the coordinates ( [L.21 ), the parameters become two real numbers and an additional integer z 6 Z: 



ff A = l 2 exp(Cvr7 1 ), / A = 2^ 7l . (3.80) 
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These holonomies are not time-independent: g\ is still constant by ( 3,72 ), but fx has a constant 
velocity, so that 

fx(t) = 2vrZ 71 + t (gxlogx 1 - To). (3.81) 

We see that the state is no longer static, as it was in the timelike case. We can calculate the time 
dependence of fx explicitly using the formula 

exp(— ^071)70 exp(^<27i) = cosh a 70 + sinh a 72 . (3.82) 

This gives 

fx(t) = 2vrZ 71 + t (cosh(27rC) - 1) 7o + t sinh(27rC) 72 . (3.83) 

Remember that fx measures the geodesic distance of the circumference of the cylinder at the base 
point x. We can see that for £ 7^ it grows linearly with t and calculating its length squared, i.e 
i Tr(fxfx), shows that it is always spacelike (except for ( = I = 0). So this time we expect to get a 
non-static solution consisting of a growing spacelike cylinder. To find quasiperiodic potentials with 
the given holonomies is not so easy: for z 7^ 0, gx is not an exponential of any element of the algebra, 
so there is no analogy of ( |3.75| ). Let us consider the case z = separately first. Then we can define 

g(r, <j>) =exp(vrC7i), 

f(r, <j>)=lcf>y 1 + t (g(r, <j>) 7o g-\r, 4>) - 7o)- (3-84) 

The field f(4>,r) has become time-dependent as well. The computation is now straightforward and 
similar to the timelike case above. The result is that the physical fields are 



1i> It =7o, 7r=7i, % = l~fi + (t~/2- (3.85) 



Here, the singularity is obviously at t = 0, so we have some kind of big bang universe. After that, 
there is an expanding cylinder with growing radius £i and "offset" 27rZ, i.e. if you go once around the 
cylinder orthogonal to the r-direction, you end up 27rZ away from where you started. If I = 0, the 
cylinder degenerates to a line at the big bang, otherwise we find that the singularity is a spacelike 
circle of radius |Z| similar to that for the timelike case above. For I 7^ it is actually the length 
of the cylinder that shrinks to zero at t = and not the radius, so the singularity is in fact the 
circumference. 

Exercise 3. 1 7. Verify this by finding a coordinate transformation such that the new r and (j) directions 
become orthogonal. Then you have 7r — > for I 7^ 0, but 7^ — > for I = at the big bang. 

As there is no mixing of space and time, i.e. the space N is always orthogonal to the time direction 
t, it is quite easy to "see" how the spacelike states look like, using the ADM picture of a space 
evolving in time. This also works for z / 0. The simplest and most illustrative way to construct 
these spacetimes is to cut the cylinder into two strips at <j> = and eft = ir. As a consequence, the 
two parts behave quite differently, and spacetime is no longer symmetric under rotation of the </> 
coordinate. But anyway we should not expect that spacetime has such a symmetry in general. As gx 
is not an exponential for z 7^ 0, the idea is to write it as a product of two exponential 1 2 = exp(7T27o) 
and exp(7rC7i), such that the quasiperiodic field g increases from 1 to 1 2 on the first strip and to gx 
on the second. The resulting field looks a bit complicated, as we have to define it on each strip of N 
(labeled by an integer n) separately: 

gf r a) _ |exp(n7rC7i)exp(((/)-n7r)z7o) for 2vrn < <j> < 2vrn + vr, ^ gg . 

* ' \ exp (((/) — n7r)C7i) exp (n7TZ7o) for 2-7rn — it < 4> < 2irn. 
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For fx we could in principle insert the same expression as in ( 3,84 ), but the metric becomes simpler 
if we take / to be constant on the first strip and increasing only on the second one, but of course 
twice as fast: 



f(r, 



t(gjog 



-1 



7o) + { 



2nirl 71 



for 2im < (j) < 2im, + 7r, 



mr)l 71 for 2im — ir < (j) < 2irn. 



(3.87) 



Exercise 3.18. Show that g and / are continuous, and that they are normalized quasiperiodic with 
parameters g\ and fx given by ( 3.80 ) and (|3.8l| ). 



The 4> components of the gauge fields become 

- J Z70 ' J forO<0<vr, 

^-\C7i, 7 *-\2/ 7l -2Ct72 for7r<0<2vr. (3 - 88) 

They are of course periodic, but not continuous because the potentials are not differentiable at 
(j) = nir. However, in principle g and / can be deformed slightly in the neighbourhood without 
destroying the quasiperiodicity, so that the gauge fields become continuous. The components of the 
fields tangent to the "border" <j) = nir, i.e. the r components, are continuous, so all the non-local 
objects like transport operators and lengths of path are well defined anyway. The physical fields are 

/ 22x72 for < 4> < ir, 

It = 70, Ir = 71, ^ = { 2lli _ 2Qtl2 for7r<0<27r . ( 3 - 89 ) 

To read off the geometry, it is again more convenient to write down the infinitesimal distance. This 
contains the same information as the metric, which is just the sum of the squares of the factors in 
from of the gamma matrices (with a minus sign for 70). 

1 a _ fdt7o + dr7i + 2zrd^72 for < < vr, 

la \ dtj + (dr + 21 d0)7i - 2(t d^7 2 for vr < < 2vr . ^' JUj 

Now it is not very difficult to see how this space looks like. For z = ±1, the first part consists of 
two copies of flat Minkowski space in polar coordinates glued together in the origin as described in 
section |2.4j , but there is a cut along a radial line, and (j) runs from to 7r only (therefore the factor 
2 in 7^). For larger z, instead of a simple Minkowski space, we get a screw-shaped space winding 
around the origin z times when <f> increases from to ir. The second part consists of a parallelogram 
shaped strip of width 2nCt and "offset" 2nl, which is glued to the first part along the cuts. The 
width of the strip grows linearly with t and vanishes for t = 0. 

Exercise 3.19. Find the equivalence classes of observers connected by paths with vanishing distance, 
which by definition represent a single physical point in space only. Show that for / 7^ there is no 
essential singularity, i.e. all nontrivial such equivalence classes of points in are contractible objects. 
This was not so for the spacetimes considered before, where the paths with vanishing distance were 
circles or extended to infinity. Here you can also pass from one "end" to the other without going 
through the singularity, by passing over the strip at r < irl. So actually we have a wormhole spacetime 
rather that a conical singularity. If / = 0, but C 7^ 0, then there is only one essential singular event 
at t = 0, r = 0, arbitrary, which forms a circle of observers and cannot be contracted. 

So the spacelike states are already rather strange. You can try to find other coordinates that make 
them "smoother", avoiding the rather artificial piecewise construction. Another possibility for the 
quasiperiodic field g is to use the global coordinates (w, x, y) on SL(2) introduced in exercise |1.8| . This 
leads to a smooth but very awkward looking metric including many trigonometric and hyperbolic 
functions, and it is not easy to visualize the resulting spacetime. 
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What remains are the lightlike states. They are again completely different. As in the spacelike 
case there is an additional integer z labeling the part of SL(2) in which the holonomy g\ lies. We 
only consider the case z = here. For non- vanishing z you can again cut the cylinder into pieces 
exactly as we did in the spacelike case, with an even more peculiar result. Let us also assume that g\ 
lies on the upper lightcone, otherwise we get the same type of solutions with eft i— > —(ft. By a suitable 
Lorentz rotation, we can map any point on the lightcone onto any other, so we may take 

9x = exp(vr(7o + 7i))- (3-91) 

The corresponding quasiperiodic field is 

g(r,cft) =exp(i0(7o + 7i)) g(r,#) = 1 + ±<£(7o +71), (3-92) 

where the last equation follows because the matrix 70 + 71 is nilpotent. For fx we make the general 
ansatz 

fx = 2vrZ (70 - 71) + 2vra (70 + 7i) + 2^672, (3.93) 

with real numbers l,a,b. By exercise |3.16| we know that we can add any matrix to fx which is 
orthogonal to 70 + 7i- This time, however, we can not arrange fx to be proportional to this matrix, 
because it is orthogonal to itself. Instead, we can add anything to a and b, which also become time 
dependent, but we cannot change I, which is the only gauge invariant parameter for the lightlike 
states. The time evolution for fx is 

fx = QxloQx 1 - 7o = 2vr 2 (70 + 71) - 2tt7 2 . (3.94) 

Hence, we have a = tt and b = —1. We will adjust a and b such that the resulting metric becomes as 
simple as possible. Next, we have to find a normalized quasiperiodic field /, and we make an ansatz 
similar to fx' 

f(r,4>) = u{4>) (70+71) +v(4>) (70 - li)+w{4>)l2- (3.95) 

The equation f(r, (ft + 2ir) = gxf(f, 4>)9x l + fx now becomes a functional equation for u, v and w, 
depending on a and 6, and the best thing one can do is to play around with these parameters until 
you get a simple final result (which is rather cumbersome, unless you use an algebraic computer 
program). It turns out that a good choice is 

a = (f tt 2 - 1) I + tt t, b= -2ttI - t. (3.96) 

Solving the functional equations leads to 

u(<ft) = ±lcft 3 + \t<ft 2 - left, v((ft) = left, w{(ft) = -left 2 - tcft. (3.97) 

and a straightforward calculation yields the following gauge fields: 

= 3(70 + 71); 14> = ~ 2l li ~ *72, (3-98) 

Finally, the physical fields are 

It = 7o, 7r = 7i, % = -2/7i + (r -t) 72. (3.99) 
Again, the time direction is orthogonal to the space M, and the infinitesimal distance is 

da; a 7 a = dt 7o + (dr - 2/d0)7i + (r - t)d<ft l2 - (3.100) 
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The best way to describe this spacetime is to call it a cosmic hoover. For I = and fixed t we 



simply have the two copies of flat space known from section 2.4, with a singular point at the origin, 
which is now at r = t. At this point the two copies are glued together, i.e. the circle r = t, (ft 
arbitrary, represents one spacetime event only. But now this structure is not static, as the singularity 
moves with the velocity of light in the direction of r. It is not simply a singularity moving around 
somewhere in the space jV. The situation is much worse: the positive r part is totally absorbed into 
the singularity with the velocity of light, and the observers disappearing into the singularity (where 
they meet all the observers with the same r coordinate) reappear on the other side. 

For I ^ there is no essential singularity any more. Spacetime becomes totally regular as in the 
spacelike case, and observers never meet at a single event. This can be seen as follows. If there is 
any path connecting different observers with vanishing distance, we must have dt = 0, dr = 2ld(ft, 
and (r — t)d(ft = everywhere of this path, which is impossible. Hence, we are only using bad 
coordinates at r = t. Instead of the singularity we get a wormhole connecting the two regions for 
positive and negative r, but still the vacuum cleaner is active, as the wormhole absorbs everything 
with the velocity of light. 

4 Canonical Quantization 

The aim of this section is to carry out Dirac's canonical quantization method Q as proposed by 
Witten |TJ|], starting from the classical phase space spanned by the spatial components of the spin 
connection and the dreibein, as well as the boundary fields. A quantum state is given by a "wave 
functional", which depends on one half of the phase space coordinates, spanning the "configuration 
space". These configuration variables are represented by multiplication operators, i.e. we have to 
choose them such that they commute, and the conjugate momentum operators become functional 
derivative operators. 

The physical state space is then defined to be the kernel of the constraint operators: acting with 
the constraints on physical states gives zero. For this step it is essential that the commutator algebra 
of the quantized constraints closes, i.e. the quantum constraints have to be first class. This does not 
necessarily follow from the fact that the Poisson brackets of the classical constraints are proportional 
to the constraints again. There are mainly to problems arising with Dirac quantization at this 
point. The first is that straightforward translation of the constraints may lead to ill-defined operator 
products: there might appear functional differential operators acting at the same space point. This 
would require a regularization of the quantum operators. However, the great success of dreibein 
formulation of three dimensional gravity is mainly based on the fact the this doesn't happen, i.e. the 
constraints will become perfectly well defined operators without any regularization. From a physical 
point of view, the reason therefor is that the theory is not really a field theory with local degrees 
of freedom, whose quantization normally requires a regularization. In principle, we are dealing with 
"quantum mechanics", our system has finitely many degrees of freedom and we could equally well 
use our classical results and quantize the finite set of holonomies. This is the so called reduced phase 



space method [15|. However, constructing the physical phase space with Dirac's method will turn 
out to be even simpler than classifying the classical degrees of freedom explicitly. 

After the constraints have been translated into their quantized versions, a second problem may 
arise, namely the operator ordering. The classical constraint algebra tells us that the commutator 
of the constraints is proportional to themselves again, but it does not tell us whether the operators 
in the commutator are in the same order as before. In general, there may be different choices for 
the operator ordering leading to different representations for the constraints. There is no reason 
why one of these orderings should lead to an algebra such that taking commutators will preserve this 
ordering. In addition, the structure constants appearing in the constraint algebra may itself be phase 
space functions. Then the quantum algebra must be such that these "structure functions" appear to 
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the left of the constraints. Otherwise the definition of the physical states would not be consistent, 
as acting with a commutator of constraints on a physical state gives zero. But the constraints are 
the only operators that by definition annihilate physical states. If it is not possible to arrange the 
operator ordering in this way, Dirac's method does not work and one has to choose other quantization 
schemes. Again, three dimensional gravity in dreibein formulation does not cause any problems here. 
We already saw that the structure constants of the classical algebra of the constraints ( |3.41| ) are not 
field-dependent. We will also find that there is no operator ordering ambiguity, as all products of 
phase space variables appearing in the constraints are such that the factors commute. 

Hence, all the typical problems occuring in field theory are absent, though we are still dealing 
with an infinite dimensional phase space at the beginning. The reduction to finitely many degrees of 
freedom will automatically come out when we solve the constraint equations. Thereby, we will obtain 
the physical states as wave functions depending on a finite set of variables like the wave functions 
on ordinary quantum mechanics. The final step to get the Hilbert space is then the construction of 
the scalar product. It will be determined by requiring the operators corresponding to real classical 
quantities to become hermitian. 



4.1 Quantum operators and constraints 

The first step in Dirac's programme is to define an operator representation for the phase space 
variables, i.e. the spatial components of the gauge fields cjj,7, and the boundary fields u,v. The 
most reasonable choice is the "connection representation" , where the wave functional depends on cjj 
and u. This is because we cannot choose v to be a multiplication operator, as its components do not 
commute, and with u>j as a differential operator the Hamilton constraint would not be well defined, 
as it contains a quadratic term in u>j. Hence, the wave functional is given by ^[t^i, u]. 

As the functional depends on Lie group and algebra valued variables, we introduce some notation 
to deal with derivatives with respect to such variables. Consider a function depending on an algebra 
valued variable v = v a ~/ a . We can then define the derivative of this function in the direction of "y a , 
which is simply given by 

Of course, we can insert any other matrix a E sl(2) instead of the gamma matrices. If the function 
is given as a power series in v, then we can calculate the derivative most easily using 

H a ^) v = a - ^ 

This is of course quite simple because the algebra is a vector space and we can take derivatives with 
respect to the coordinates. We get the same result if we define a matrix valued derivative operator 

Gr) =*-< < 4 - 3) 

\OV ' pq OV qp 

which is a derivative with respect to the matrix components. Taking the trace of this operator 
with some algebra element a reproduces ( |4.2| ). However, the derivative with respect to matrix 
components is only defined if the support of the function it acts on is (an open subset of) M 2x2 . If 



we only use expressions like (4.2), where a is a matrix which is "tangent" to the algebra (as a subset 
of M 2x2 ), then we can extend the function to be differentiated arbitrarily to M 2x2 before using the 
derivative (|4.3| ). The result will be independent of this extension, as we only take the components 
tangent to sl(2) cR 2x2 . 

Now consider a function depending on u £ SL(2). We can extend it to some open subset of M 2x2 
containing SL(2) and apply the matrix valued differential operator (4.3), with v replaced by u. We 
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are then only allowed to take those components of the derivative which are tangent to the group, i.e. 
we have to "trace" the result with ua or au for some a £ $1(2). These are the matrices tangent to 
SL(2) at u. Hence, there are two complimentary sets of derivative operators 

M au ^)> «^[(2). (4.4) 



Exercise 4-1- Show that ( [4.4Q are the left and right invariant vector fields on the Lie group, respec- 
tively, and that they can be defined on SL(2) as well, obeying 



Tr(ffa^L) g = ga, Tr [ag ) g = ag, a 6 sl(2) 



dg 



0_ 
dg 



(4.5) 



Note that here g is a treated as a matrix valued function of g, and the trace is a formal notation 
only. Moreover, the total differential of a function F(g) on SL(2) is given by 



dF = | Tr^-Mff 7o ) ^{pi a ^f ) = \ Tr(d ff g' 1 ^) Tr( 7 a flf^ 



(4.6) 



Here the first traces are "real" traces of matrices, with dg the differential of the special function 
g i— » g, whereas the right hand factors are the formal expressions (|i.5|). 

The derivative operators on the group do not commute, but they provide two representation of the 
Lie algebra, commuting with each other: 



Tr 



(l a i 



d_ 
dg 



Tr h b g 



d_ 
dg 



2e ab c TiH c g 



d_ 
'dg 



Tr(^|),Trf^A 
d 



Tr(j a g 



dg 



dg 



dg 



0. 



(4.7) 



The translation from classical Poisson brackets to operators is defined such that the latter is —\fi 
times the classical bracket. From fl3.29|) we get the operator representation for the dreibein 



Tr(a7j(x)) = -ifey Tr(a 



5uj (x) 



which gives the commutator 



[Tr{aji(x)),Uj(y)] = -ih£ija5(x,y). 



(4.8) 



(4.9) 



The operator representation for v is more complicated, but from the bracket ( [A. 20 ) we can read off 
that a suitable representation is 



Tr(a^ )) = m ^d0Tr(^(a,^-^»o)«(0)^y), 



(4.10) 



where A(a, (ft) is the sl(2) valued step function introduced in the appendix. We actually only need 
the following property of this operator: 



[Tr (av((f> )),u 



for < 



< 2vr, 



(4.11) 
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which follows from ( [A. 24 ), and we need the operator representation for 7^, which is by ( |3.33 ) 



Tr(a%(<t>)) = -ih Tc(«(^ - 2ttz) o. _ ). (4.12) 



We can now write down the quantized constraints. We define them as iTi -1 times the classical 
quantities. As a consequence, no extra factors of Ti appear in their algebra and they generate exactly 
the same transformations on the wave functionals as the classical constraints generated on phase 
space functions. The Hamilton constraint is a pure multiplication operator and looks like ( 3.35| ): 



H[n] = ih- 1 I d 2 xe ij Tr(rai^) + ih' 1 J2 { ^ Tr(n(w _1 ^w - w^)). (4.13) 
J e J 

The Lorentz constraint becomes a first order homogeneous differential operator. Here it is more 
suitable to use the from ( 3.37| ), to avoid spatial derivatives acting on functional derivatives: 

L[X] = -ih' 1 J d 2 xe ij Tr(AATj) + ih' 1 f ^ Tr ( X %) 

r J\ poo Js 

= / d 2 x Tr(AA— ) + V / dcj) Tr{uX— ). (4.14) 



Note that the sum in ( [4.12 ) has been used to change the range of the boundary integral, which runs 



over M instead of once around the circle only. 

Exercise It is now easy to see that all the classical properties of the constraints are preserved: 

1. there is no ordering ambiguity, i.e. the constraints do not contain products of non-commuting 
operators. 

2. the commutators of the constraints with any field operator exactly coincides with the classical 
brackets (|3~38l) and (|H). 



3. the algebra of the constraint operators is closed and given by ( p. 41 ). 



4.2 Physical states 

After defining the operators for the phase space coordinates, we now have to construct the physical 
state space, which corresponds to the classical reduced phase space. It is the set of all possible 
gauge-inequivalent states of our "universe" . 

Definition 4-1 (Physical State Space). The physical state space is the kernel of the constraint oper- 
ators. A physical state is represented by a wave function satisfying 

H[n] 9[uh,u] = 0, L[X] u] = 0. (4.15) 

Solving these equations is even simpler than finding the general solutions of their classical versions, 
as only half of the fields are involved. The first equation states that ^ is non-zero only for those 
configurations u>,u with = and u~ 1 d ( j,u = on the boundaries. The solution is some kind 
of "functional delta function", which, however, makes the treatment of the second equation a bit 
complicated. Hence, let us first eliminate all functional derivatives. We can do this by writing the 
Lorentz constraint equation as a functional equation instead of a functional differential equation. 



From (4.14) we infer that L[X] acts on the fields as 

L[X] un = A A, L[X] u = uX. (4.16) 
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and since it is a first order homogeneous differential operator we have the Leibnitz rule 



L[X] (FG) = (L[X]F) G + F (L[X}G), (4.17) 

where F and G are functions of uji and u. Thus, L[X] generates Lorentz transformations on the 
wave functional, and the constraint equation tells us that ^ has to be invariant under local Lorentz 
rotations. As we know how finite Lorentz rotations act on the fields, we can rewrite the condition as 

=V[h-\d l + u l )h,uh] (4.18) 

for any field h G SL(2) which is the projection of some SL(2) valued field h on J\f . Together with the 
Hamilton constraint 

tt[u*, u) = if F i:j /Oor^/ %T%u, (4.19) 

we get a set of constraints without functional derivatives, so we do not have to bother about differ- 
entiability of ^ any more. The two conditions are consistent in the sense that the inequalities in the 
second condition are preserved under Lorentz transformations, because the constraints are first class 
and form a closed algebra. 

To solve the Hamilton condition, remember that we know the complete solution for the equations 
Fij = and = u~ 1 d ( pu. They are uniquely parameterized by a normalized quasiperiodic field 
g : J\f —* SL(2) and an extra variable bi G SL(2) for each boundary of M: 

^i{x)=g~ 1 d i g{a x ), u{4>) =bi 1 g{a t ^). (4.20) 

Due to the quasiperiodicity of g we can insert any a x G M with ir(a x ) = x into the first equation. 
To get the second equations we needed the family of boundary paths ai^ from figure [|. Therefore, 
the complete solution to the Hamilton condition can be written as 



Mu 1 ,u] = \^9,{bi}] if (4.20) holds, (421) 
1 otherwise. 



From ( 3.6l| ) we know how Lorentz rotations act of the potential g and the boundary variables bi, so 
we can rewrite the Lorentz condition ( |4,18P as 

F[g,{b e }} =F[h 1 gh,{h 1 b e }}, (4.22) 

which must hold for any periodic field h : N —* SL(2), where h$ = h(x) is its value at the base point 
of J\f . Remember that this extra contribution to the finite Lorentz transformation of the potential g 
was necessary because g is normalized quasiperiodic. 

We found that two potentials g and g' are always related by a Lorentz rotation if there values 
coincide on the loop group T C N . Therefore, T is in fact a function of the values g\, A G T 
only. In addition, we know that the values g\ are not all independent, as they have to provide a 
representation of the loop group 

9\9k = 9\,k- (4.23) 

So we conclude that T can be written as a function of g\, A G To, which is the set of primitive loops 
introduced in definition |1.6| : 

F[g,{b e }]=F({gx\XeT },{b e }). (4.24) 

Here we have to be careful if the primitive loops are subject to relations as described in definition |1.6| . 
In this case, the support of T is given by those sets of elements {<7a} which fulfill the corresponding 
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relation in SL(2). In case of M being a torus, this would mean that the support of T is given by the 
set of all pairs of commuting SL(2) matrices, as from the relation A • k = k • A for the two primitive 
loops of a torus we get g x g K = g K fJx- 

Yet we have not solved the Lorentz condition completely, because the variables still transform 
under Lorentz rotations at the base point x, and T has to fulfill the relation 

T({g x }, {bi}) = Tdho'gxho}, {ho%}) (4.25) 

for any ho € SL(2). With definition |1.13| for inner automorphisms of SL(2), we can drop the tilde 
from ho here, as the transformation g ho l gho can equally well be written as g i— > ho l gho, where 
ho is the projection of ho onto SL(2). Similar to ( 3.69| ) for the classical holonomies, it is not difficult 



to do the last step and get a full classification of all solution to the constraint equations, provided 
that M has at least one boundary. Giving this boundary the special index t = o, we found that 
g ,\ = bo X g\b and b Q ^ = bo 1 be form a complete set of independent gauge invariant quantities. So 
the wave function depends on these variables only. The final result can be summarized as follows: 

Theorem 4-1- If the space manifold N has at least one boundary I = o, the physical state space is 
given by the set of wave functions 

= f ^({Ao.aIA e ToMM* + °» if ( 42 °) holds < (4.26) 
' 1 otherwise. 

If M has no boundary, the general solutions to the constraint equations is 

W M] = fH{9x\\ G T }) if (4.20) holds, , . 

[ li \0 otherwise, V ' ; 

where T is subject to the relation 

H{9x}) = Hih^Qxho}) for any ho G SL(2), (4.28) 

and the support of J- is restricted by relations present between the primitive loops. 

For physical states we will use Dirac's bra-ket notation, the states given above are denoted by \ J-). If 
there is no boundary, it is more difficult to find the complete physical state space explicitly, especially 
because of the relations between the primitive loops, and identifying equivalence classes of special 
sets of SL(2) elements can be rather awkward. Formally, a basis of the physical state space can be 
given in terms of the so called moduli space, which is defined as follows: 

Definition 4-2 (Moduli Space). The set of equivalence classes of group homomorphisms T — ► SL(2) : 
g\ modulo inner automorphisms g\ t— > ho~ l g\ho is called the moduli space of a manifold ftf, of 
which T is the loop group. 

If N has no boundary, every point in the moduli space exactly corresponds to a basis vector of the 
physical state space, represented by a wave function T that has support on this equivalence class 
only. Hence, physical wave function can be written as a function on the moduli space. Alternatively, 
the moduli space can also be defined to be the set of equivalence classes of maps To — > SL(2), obeying 
the relations present between the primitive loops. In this case it is obvious that the moduli space is 
finite dimensional. 



4.3 Observables 

Compared to ordinary quantum mechanics of point particles, we are now at the very beginning, 
where the state space is introduced as the set of wave functions ^[{xj}]. We now have to define 
operators acting on this physical state space, which have to be related to physical measurement. If 
an operator acts on the physical states space, it must map solutions of the constraint equations onto 
solutions again. It is exactly this what defines an observable: 
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Definition 4-3 (Observables). A phase space function or the corresponding quantum operator O is 
called an observables, if it commutes weakly with the constraints 

[L[X],0] « 0, [H[n),0] « 0. (4.29) 

Two such operators are considered as the same observable, if they are weakly equal. 



Exercise 4-3- The Poisson bracket or commutator of two observables, or any function of observables, 
is again an observable. 

At the classical level, (weak) vanishing of the Poisson brackets with the constraints means that O is 
(at least on the physical phase space) gauge invariant. At the quantum level, an operator vanishes 
weakly if it maps all physical states to zero, and observables are exactly those operators that map 
physical states onto physical states. It is reasonable to consider weakly equal operators as the same 
observable, because they take the same values on the physical phase space, or their action on physical 
states are identical, respectively, so they correspond to the same physical quantity. 



In section |3.3| we already looked for the classical gauge invariant phase space functions. As a 
result we found theorem |3.2| , which tells us that all observables can be written as functions of the 
holonomies gx and fx for the loops and be, q for the boundary paths. As we want the observables 
to be (real) numerical quantities, let us for a moment ignore that g\ is an element of SL(2) and take 



g\ G SL(2) instead as the basic quantity. Using (|3.59|) , we can express the holonomies directly as 
phase space functions: 

9x = U x , b i = U atA u~ 1 , 

fx = V x , c e = j ~ u ^u~ l vuU ai(f i. (4.30) 

Remember that ote^ denotes the family of boundary paths from figure [l], and, as indicated by the 
ordering, that the fields u and v are to be taken at the endpoint of the curve a^, i.e. at the angular 
coordinate on the boundary. The values of be and C£ as defined in ( |3.59| ) were (weakly) independent 
of the special value chosen for (j), so for later convenience we keep it arbitrary here. Similarly, the 
holonomies also depend on the special path representing the homotopy class. However, they always 



lead to the same observables in the sense of definition 4.3, because under smooth deformation the 
variations of U and V are proportional to the constraints. 

The holonomies themselves are not yet the observables, as they still transform under gauge 



transformations acting at x, which were given in (3.68). To obtain observables, we have to find 



quantities which are invariant under these transformations. In case of a manifold without boundary, 
we only have the loop holonomies gx and fx- To get an observable, we can take the trace of gx, 
which is of course invariant under inner automorphisms. Another gauge invariant quantity is the 
trace of g\f\, which is a matrix tangent to gx- We denote these observables by 

Ux = \ Tr(g x ), V A = Tr(g x f x ). (4.31) 

They provide gauge invariant information contained in the holonomies gx and fx, telling us whether 
gx is timelike or spacelike etc., so essentially they measures the type of the "sigularity" associated 
with the loop A. However, there are still some "exceptional" gauge invariant properties of gx which 
are not encoded in the trace. If, e.g., gx is timelike or lightlike, then the sign of Tr(gryo) is invariant. 
We will need those observables explictly to construct the state space of the torus. 

Exercise 4-4- Of course, the trace observables exist for manifolds with boundaries, too. Find out 



how Ux and Va are related to the parameters £, I, z used in section 3.4 to classify the solutions of the 
constraints on the cylinder manifold, and which additional observables are needed to determine £, I 
and z completely. 
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For the rest of this section we will consider a manifold with at least one boundary. With the 
results of section [D], it is not difficult to find out how observables in general look like. We have 
to take some loop A (including the trivial loop A = rj), and multiply the pair (g\,f\) with some 
boundary holonomies (but not necassary the same) from both sides to make it invariant under inner 
automorphisms. Hence, a pair of observables is given by 

(6* P*)" 1 ^,/*) (4.32) 

Obviously, we can label such an observable by a path {3 = al 1 • A • a^, whose end points both lie 
on a boundary of J\f (possibly the same). The special observables Q3.69 ), which form a complete 



set of independent observables, are included in ( 4.32| ) as those for paths starting at the special end 



£ = o and going directly to another end, or around some primitive loop and back to the boundary 
o. We can express the observables directly in terms of the phase space variables using the transport 
operator and distance of the path (3: 

(Up,Vp) = {u,v){fJp,Vp){n,v)- x . (4.33) 

Again, our convention applies and that the boundary fields (it, v) are to be taken at the beginning 
or end of (3, respectively. In components, the observables read 

Up = uUg U , 

Vp = v + uVpu~ 1 - uUp u^v uUp-iu' 1 . (4.34) 

The ordering indicates where the fields are to be taken: Vp has to be an s 1(2) matrix at the beginning 
of P, so the first v and the "outer" it's are sitting there, whereas the "inner" boundary fields are 
sitting at the end of (3. An important feature of all these observables is that they do not refer to 
the special point x any more. They are either the trace of a holonomy along a loop, which can 
be deformed arbitrarily such that it no longer passes through x, and the same holds for a path (3 
connecting two boundaries. In addition, their end points can be shifted along the boundary, as the 
definition of the boundary holonomies in (3.59) is independent of <f>. In principle we could require the 



paths (3 labelling the observables to start and end at (j) = 0, but let us allow to shift the end points 
to small positive or negative angles. We will not shift them more than once around the boundary, as 
this might course some trouble with the non-periodicity of the (it, v) fields. Some of these paths are 
shown for the cylinder manifold in figure ||. 

The Up and Vs are in some sense the smallest possible complete and "symmetric" set of observ- 
ables. More precise, they form the smallest set of observables such that every observable is a function 
thereof, but without marking any special set of "primitive loops" or choosing a special boundary like 
£ = o. As we already introduced these special objects to define the physical state space, we should 
better not use them again here, which might break the symmetries of our universe. For example, 
we will require all basic observables to become hermitian, not only the already complete set ( 3.69| ). 



Otherwise it could happen that only for special observables, which a priori cannot be distinguished 
from others, there are well defined eigenstates with real eigenvalues etc., whereas for others are not 
because they are not hermitian. 

The relations present between the basic observables are the same as those for transport operators 
and distances of arbitrary paths: 

(U a ,V a ) (Up, Vp) = (U a .p, V a .p), (4.35) 

which, of course, only holds if (3 starts at the boundary where a ends (but the linked path can be 
deformed away from this boundary). To discuss the problems arising from quantization of these 
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observables, it is more convenient not to use the IS0(1,2) notation and switch back to the vector 
representation of the sl(2) valued variables. Hence, the observables are 

Up = uUpu~ l £ SL(2), 

Vp a = \ Tr( 7 a V Q ) = v a + u a b Vp b - u\ Up\ u d c v d G M 3 , (4.36) 

where u% = ^ Tr(ii _1 7 a ti7b) and v a = \ Tr(i?7 a ) are the vector representations of the boundary 
fields, obeying [u~ x Yd = u d '■ They directly correspond to basic physical quantities, which will 
be discussed in more detail in the next section: Up discribes the transport of a spinor from one 
background frame into another (or back into the same along a non-trivial loop), and Vp a measures 
the relative position of two background frames. In components, the relation Q4.35 ) becomes 

U a .p = U a Up, V a .p a = V a a + U a \Vp b . (4.37) 

The task is now to find the quantum representation of these observables such that (if possible) these 
relations as well as their classical Poisson brackets are preserved, and such that we can define a 
scalar product on the physical state space which makes them all hermitian. Let us start with the U 
observables, as this is quite simple. 

Of course, they all commute, because they depend on Ui and u only, and they are represented by 
multiplication operators acting on the wave function J~({g ,\}, {b ,l})- In particular, the observables 
associated with the variables on which the wave function depends act as 

U a -i.x»a Q \?) = \ 9o,\F) , U a -±. ai \F) = \ b o/ F) . (4.38) 

All other U observables can be obtained as functions thereof, and because they all commute it will be 
sufficient to render the observables ( [4.38 ) hermitian, then a real function thereof becomes hermitian 



as well. 

As a consequence, the eigenspaces of these operators with different eigenvalues have to be or- 
thogonal. However, these eigenspaces are still degenerate, as the U observables fix the values of 
g 0) x but not that of g Q \. There should be an observable associated with the winding number of g\. 
Introducing such additional observables, the eigenspaces would become non-degenerate, provided 
that we want to make them hermitian operators and their eigenstates normalizable. However, this 
is reasonable only if they really corresponds to physical measurements, i.e. if an observer inside the 
universe can measure them. We will see that this is not necessarily so, and, e.g. for the torus, the 
winding numbers do not correspond to any physical measurement. This will also have some effects 
on the scalar product, whose definition becomes rather a physical than a mathematical problem. So 
let us define the scalar product in the next section, after clarifying the role of the winding number 
as a physical observable. 

To find the action of the V observables on the physical states, we will exploit their commutation 
relations. To get the commutator of some V a a with Up, we proceed as follows. The non-commuting 
fields are the dreibein 7, appearing in V a a through V a b , which is conjugate to the spin connection uJi 
in Up, and the v a and v d boundary fields in V a a , which do not commute with the it's in Up. However, 
we can deform the paths such that all boundary fields commute. What we have to do is to shift the 
endpoints of ft to a small positive angle (f> = e, thereby keeping the endpoints of a at (p = (if the 
two paths do not meet at a boundary, this procedure is not necessary). As v(0) commutes with u(e) 
for < e < 2tt, the only remaining commutator to be computed is 

[V a a ,Up] = [u a b V a b ,uUpu- 1 ]. (4.39) 

Hence, what we actually have to compute is a commutator of a distance of one path with the transport 
operator along another path. The distance is explicitly 

V b = \ /' 1 d S a , X S )Tr(7 6 J7 a (0, S )7 i C/ a ( s ,0)). (4.40) 
Jo 
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Figure 3: The solid line corresponds to a distance operator with an index b sitting at its start 
point, the dashed line to a transport operator. For each intersection there is a contribution to the 
commutator, consisting of a transport operator that "picks up" the gamma matrix. The "sign" is 
^ifr if you have to turn right at the intersection, and — ^ih if you turn left. 




[V a b , Up] U p (0, s) U a (s', 0) 7 6 U a (0, s') U (s, 1) 



Obviously, there might be ordering ambiguities, because we can place the dreibein equally well to the 
left or right, but this does not affect the commutator to be computed here. A useful general formula 
is 

[ei a (x),U a (a,b)] = -\ih f e<te i jc! i {t)8{xMt))U a {a,t)1 i U a {t,b), (4.41) 

J a 

which follows from the basic commutator relation Q3.31|) and the functional derivative of the transport 
operator with respect to the spin connection, which has been given in ( |2.19| ). Inserting this, we find 

[V a b , Ub] = - |m f ds [ dte ij a' i {s)(3' j {t)8{a{s) } (3{t)) x 
Jo Jo 

U p (0, t) U a (s, 0) 7 b U a (0, s) U p (t, 1). (4.42) 

Here we get a contribution to the commutator whenever the two paths intersect. The term produced 
at each intersection consists of the transport operator along (5 to the intersection point, then along 
a -1 to the beginning of a, where it "picks up" the matrix "y b , then back to the intersection and 
further along (3. Figure || shows a picture of this operation. The sign of each contribution depends 
on the direction in which the paths intersect. The picture also shows the normalization of the e 
tensor: we defined e p( j, = +1, where p is the coordinate pointing towards a boundary and cj) runs 
clockwise on the boundary. For the curves a and (3 in the picture we then have EijcJ % $ 3 < at the 
intersection, so that the factor in front becomes +^ih. What remains to be shown is that the delta 
function contributes a factor ±1 at each intersection, the sign depending on the direction in which 
the paths meet. More general: 

Exercise 4-5- 

[ ds f 1 dte ij a' i {s)f3' j (t)8{a(s),p{t)) (4.43) 
Jo Jo 

gives an integer that counts the number of directed intersections of the paths, where an intersection 
is positive if a "comes from the left" . 

An illustating example for this graphical method to compute commutators will be given in the next 
section for the cylinder manifold. What can be shown quite general is that the U and V observables 
for the same path do not commute: 
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Exercise 4-6- Using figure [|, compute the commutator of V a a and U a (with the same path). Re- 
member that you have to shift the ends of the U path clockwise by a small positive angle at the 
boundaries, which produces one intersection between this and the V path if a connects two different 
boundaries, but two intersection points if a starts and ends at the same boundary. The result should 
be 

h; a 7 , i _ / -\\h(YU a - U a -f a ) for loops a, , , 

[V a ,Ua\ - < _i ihYUa otherwise. (4>44) 



As a consequence, show that 

[Va a ,Ua,a h }={ [n£hCdUa ' Cd fo^ loops Q, (445) 

1 otherwise. 

We will not derive the commuator algebra of the V observables explicitly, but we can see how they 
look like in general: 

Exercise 4- 7. Show that the set of all observables (U, V) which are linear in V form a closed alge- 
bra under the Poisson bracket and a graded algebra together with the observables 0^(14) depending 
on U only, i.e. 

{O(i) ) 0(i)} = O(i) ) {e>«e>(°)} = e>(°), {0C°),0(°)} = O. (4.46) 

Including higher order homogeneous polynomials of degree n in V the observable algebra becomes 

(C)( m ) ) = £)("H-n-l)_ (4.47) 

To find the action of the V observables on physical states, we first have to fix the operator ordering 
somehow. The simplest possible choice would be to place the differential operators to the right. This 
would define a unique quantum operator for every observable and the algebra ( |4.46|) would be 
preserved. In addition, the relations ( 4.371 ) would hold, too. However, the last two factors U a % 



and Vg in this relation will in general not commute, so it cannot be a relation between hermitian 
operators. Hence, with this simple ordering prescription we cannot have all V a a operators hermitian, 
whatever the scalar product is. Instead, we should have 

V a .fs a = V a a + \U a \ V/ + \Vp b U a a b . (4.48) 



Exercise 4-8. Show that this relation is still associative, i.e. V a> ( i g. 7 ) a = V( a>( g). 7 a . 

This suggests another prescription. Suppose that a and (5 are two paths such that there is no ordering 
ambiguity in V a a and V/3 b (assuming that such paths exist, see below), but there is one for a • j3. 
Then ( 4.4§| ) already fixes the ordering for V a9 p a , which is given by the "symmetrized" operator, i.e. 



^(differentials right) + J, (differentials left). (4.49) 

With this prescription, we get a unique quantum operator for any (and of course also for any 
0(°)). To check whether this is a suitable ordering, let us introduce the notation rO and lO for 
the quantum operators corresponding to a classical observable, with the differentials ordered to the 
right or left, respectively, including a factor \ to simplify the notation. The quantum operators are 



then defined by O \—* rO + lC What we have to check is that the relation ( 4.48 ) holds for these 
operators. The left hand side becomes 

= R V a a + L V a a + U a \ R Vp b + L V p b U a a b . (4.50) 
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This must be equal to the right hand side 



R V a a + L V a a + \U a \ ( R V + L Vp b ) + \{ R Vp h + iy P h )U a \. (4.51) 
The difference between the two expressions is obviously one half of 

[Ua a b,RVp b ] + [LVp b ,U a a b], (4.52) 
which vanishes because the commutator is independent of the ordering chosen for Vg b . 



Exercise 4-9- Show that with the prescription ( 4.49| ) the classical Poisson bracket of the observ- 



ables ( 4.46 ) is preserved, i.e. 

[ r O + zA rO' + l O'\ = -ih( R {0, O'} + L {0, O'}), (4.53) 
for observables 0,0' = CK^CK 1 ) linear in or independent of V. 

This is the best we can get: a unique quantum operator for each classical observable that is linear 
in the momenta, and their classical Poisson algebra coincides with the commutator relations. In 
addition, the relations between the V a a do not prevent them from being hermitian. For higher order 
observables in V there are no unique quantum operators. This is well know from quantum mechanics, 
where unique hermitian operators for higher order polynomials in p and x do not exist. 

Finally, we have to find out how the V operators act on a physical state. Before doing this, let 
us check whether there are ordering ambiguities at all (otherwise we could forget everything we just 
did). Fix some path a connecting two ends or starting and ending at the same end. We use the same 
trick as above to simplify the computation. We want to avoid products of non-commuting boundary 
fields which are somewhat awkward because of the stepfunction appearing in their commutator. This 
time we have to shift the beginning of a to a small positive angle cj> = +e, wheras the end of a remains 
at = 0. We then have 

V a a = v a {e) + u\(e) V b - u\(e) U a b c u d c (0) v d (0). (4.54) 

The only product of non-commuting boundary fields in this expression is u d c (0)v d (0): the commuator 
of v a (0) with u a b(e) either vanishes because the fields are sitting at different boundaries, or because 
for < e < 2tt their commutator is zero by ( 4. 11] ). Interchanging u d c (0) with v d (0) gives 



[u d c ,v d } = l[Tr(u- 1 7d u<y c ),Tr( 7 d v)] 

= - 1 I mTr(u- 1 l ld ,A(~/ d ,0)}u 1 c ), (4.55) 

where the operator representation ( f4.10| ) has be inserted. Unfortunately, the value of the stepfunction 
is ill-defined (but not infinite) at (ft = 0. Nevertheless, if we regularize it in either direction, we get zero 
for the commutator in both cases, as A(j d ,+e) = and A(j d ,—e) = —•~f d (see (]A.22| ) and QA.24Q ). 



Hence, the only possible ordering ambiguity occurs inside Vg b , which depends on the dreibein and 
the spin connection only. If we change the ordering in ( 4.40| ), we always produce extra terms of the 
form 



2 



f 1 ds a H (s) Tr(y b U a (0, s) lc [e 4 c (a( S )), U a (s, 0)]). (4.56) 
Jo 



Exercise 4-10. There is no ordering ambiguity in V a a , i.e. RV a a = L^a a , if all self-intersections of 
a can be removed by smooth deformations, after shifting a such that it starts at a boundary point 
with (j) = e and ends at 4> = 0. 
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This leads to a rather peculiar result if a is a loop, i.e. it returns to the same boundary where it 
started: because we have to shift the beginning to a small angle e before looking for self-intersections, 
V a a and V -i cannot be both free of ordering ambiguities, as one of them will have a self-intersection 
right at the beginning (where it meets its own end). This also follows from exercise 4.6, as by ( 4.48] ) 
we have 

V^x = - |Z4/ V a b - \V a h U a /, (4.57) 

but the two factors on the right do not commute if a is a loop. To obtain the action of V a a = 
i?V a a + LV a a onto physical states, we introduce a special state |1) given by F = 1. Obviously, we 
can then write every state as 

\F)=F\1), (4.58) 



where F is a function of the observables ( |4.38j ), but possibly also of g 0t \ (see below). In addition, we 



have R V a a \l) = 0. To see this, take the complete wave functional ( |4.26| ) , which can be considered as 
a phase space function. When the differential operators are ordered to the right, the action on ^ is 
given by the Poisson bracket 

R V a a n^u\ = -ih{V a a ,y[ui,u}}. (4.59) 

For the special state |1), is a functional depending on the Hamiltonian constraint H only: it is 
simply 1 if H = everywhere and otherwise. Now the bracket of V a a with H vanishes exactly (not 
only weakly), and therefore the bracket of V a a with any function of H vanishes, too. In addition, 
we can write 

V a a = 2 R V a a + ( L V a a - R V a a ). (4.60) 

The term in the parenthesis is in fact an observable depending on and u only: it is the commutator 
of the differential operators appearing in V a a with the multiplication operators therein. It also 
commutes with the constaints, so it must be an observable depending on IA only. Hence, we have 

V a a | F) = 2 R V a a \ F) + ( L V a a - R V a a )\ F) 

= [2 R V a a ,T] 1 1) + ( L V a a - R V a a )\ F) 

= \[V a a ,F] + ( L V a a - R V a a )F), (4.61) 

because the commutator of V a with F as well as the difference in the parenthesis is a multiplication 
operator. 

Exercise J^.ll. Here we actually need the commutator of an V a a with a function of g \, but so 
far we only have the commutator with the matrix g ^\ = U a -i.x»a an d functions thereof. Using 



exercise LI, show that the commutator with a function F(g), where g is some SL(2) valued function 



of uJi and u (like, e.g. g ,\), is given by 

[V a a ,F(g)] = \Tr{g-\V a \g] la )Tr(g 1 a ^). (4.62) 

We can interpret the last term appearing in ( f4.61| ) as some kind of "quantum correction", which 
is a multiplication operator of order Ti acting on the wave function. It vanishes for paths without 
self- intersection and has to be introduced for those with, because otherwise the relation ( fi.48 ) would 



not be satisfied. We do not have to compute this correction term explicitly if we want to get the 
action of V a a for a path with self-intersections. It is always possible to find a set of "primitive" paths 
(not necessarily identical with those used in ( |4.38j ) ) , which are free of self-intersections and such that 
every path connecting boundaries can be obtained by linking them. Then we can derive all V a a 
operators using ( 4.48| ). Moreover, we can use this relation to proof any kind of property of the V a a 



operators by induction. For example, it is sufficient to render the observables for the primitive paths 
hermitian, for the rest it then follows automatically. 
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5 Quantum Physics 



So far we have a physical state space and know how the observables act on the states. This alone, 
however, does not tell us very much about physics, as long as we cannot compute expectation values 
and things like this. A physical question which does not make use of the scalar product is: how does 
the universe in some state \J-) evolve in time? Unfortunately it doesn't evolve at all, because the 
Hamiltonian is a linear combination of constraints, and by definition it annihilates every physical 
state. This is obvious also because the wave function depends on gauge invariant objects and therefore 
constants of motion only. In some sense we are dealing with a Heisenberg picture, but the observables 
acting on the physical states are time-independent as well. This is the famous "problem of time" in 
quantum gravity 0, |3], |2q| : the state as well as the operators are time-independent, but nevertheless 
they represent a non-static universe in general. The question is how to reconstruct the time evolution. 
To investigate this in more detail, and to clarify the role of our observer in quantum physics is the 
aim of this section. 

In principle, the problem of time is also present in classical canonical gravity, where the state of 
the universe is given as a point in the reduced phase space (see definition 3.2). Strictly speaking, a 



classical state is a point in this space, which is a gauge invariant object just like the quantum state, 
and it does not evolve in time. In the classical theory it was quite easy to recover all the physical 
structures including time evolution. The reduced phase space was defined to be a set of equivalence 
classes of dreibein and connection fields. Hence, we could simple choose a particular representative 
for each class to see how the corresponding state looks like. That's exactly what we did in section |3.4| . 
For the quantum wave function this is not possible: it is just a function on the configuration space, 
sometimes called the "wave function of the universe" . The only way to recover things like "time" and 
to make physical statements is to formulate everything in terms of gauge invariant "observables". 
This is a well know feature of theories with an unphysical time parameter (see, e.g., [18, 29| for 



a general discussion). What we need to get a time evolution is a special phase space variable that 
behaves like a clock. The time shown on this clock is a dynamical variable rather than an background 
parameter, because the physical time is a dynamical quantity in general relativity. A time evolution 
can then be recovered be reading off the clock whenever a measurement is made. In fact, we already 
have such a clock, and using this will "solve" the problem of time in our toy model. 

5.1 Astronomy, clocks, and time evolution 

The main idea to solve the problem of time is to consider time as something that exists for an observer 
inside the universe as a dynamical quantity, but which has no meaning to somebody looking at the 
universe from outside, who only "sees" the quantum state, or the point in the reduced phase space. 
So let us return to our observer x sitting inside the manifold and making astronomical observations. 
A nice feature of this point of view is that it is possible to consider the same observer both as 
the relativistic observer (making experiments with test particles, clocks etc.) and as the quantum 
observer (interpreting the wave function as probability amplitude). We will also see that Dirac's 
formalism is a very convenient way to describe this situation. It will easily keep apart and show the 
relations between the objects related to "the universe", which is, e.g., the quantum state and the 
observables derived in the last section, and quantities which are related to a single observer. These 
"local" quantities are the holonomies ( |4.30| ) . They are associated with paths starting at the special 
point x and, from the physical point of view, they correspond to astronomical measurements. To see 
that some of them may be used as clocks, we have to find out their physical meaning. 

As seen from the observer x, g\ is the transport operator along A back to himself, and f\ is 



the distance of this loop, which both can be measured as described in section 2.3. However, g\ was 
actually a projection of an SL(2) valued holonomy g\, which cannot be measured by passing spinors 
along loops. To distinguish between different values of g\ corresponding to the same g\, the observer 
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has to use another method. From section 2A we know that different values of g\ really correspond 
to different physics, in that case to different deficit angles of the conical singularity. So it should 
be possible to measure this from inside the universe. In fact, W6 need. th.6 '"/oo field from (gl| ), 
or, equivalently, we have to now how the physical dreibein behaves at infinity. Without going into 
details, here is the trick how to measure gy. 

Exercise 5. 1 . Assume that A = Xi is the loop going once around an end as shown in figure [I]. The 
observer x now asks all the observers on the paths oti^ to send him the value of 7oo along these 
paths. As a result, he can define 

7oc0,s) = U ae4> (0,s)'j oo (ae i(j) (s))U ai(j> (s,0), (5.1) 

which certainly depends on the special paths and even their parametrization, but which always 
diverges to spacelike infinity for s —* 1. Choosing s large enough such that ~yoo(4>,s) is everywhere 
spacelike, he can count how often it winds around the 70 axis when 4> increases by 2tt (which is not 
necessarily an integer). This is independent of the parametrization of the paths and also invariant 
under finite redefinitions of the 700 field. Show that this winding number is twice the winding number 
of g\, interpreted as a path in SL(2), around the non-contractible loop S0(2) C SL(2). On the other 
hand, if A is not a loop winding around a boundary, and especially if there is no boundary at all, then 
there is no way to measure g\, which uses only the background structure given and is in agreement 
with general relativity. 

Hence, though from the mathematical point of view the winding numbers are observables, we will 
not consider them as physical observables in the sense that they can be measured by an observer 
inside a universe without asymptotic ends. This will have some important consequences for the scalar 
product and the state space of the torus. However, for manifolds consiting of a sphere with some 
holes, including the cylinder as our standard example, it is in fact possible to measure the winding 
numbers of the holonomies. 

What is the physical meaning of the boundary holonomies bg and cp. In ( 4.30[ ), the SL(2) valued 



matrix bf is obtained by "pulling" the field u from the boundary along the path a£ to x. Assume 
there is somebody sitting in the background emitting spinors x "at infinity" such that they appear in 
a state u~ 1 ((f))x at the boundary of M, and move along the path a^Z 1 . Then x will find them in the 
state biX- Let us ca h this "somebody" a fixed stars background, or you can also think of a microwave 
background, with "microwaves" constisting of spinors. Then it is the relative orientation of the local 
frame of x with respect to this background that is encoded in bg. The vector representation bg :a b , 
which gives a proper Lorentz rotation, actually tells us how the local frame of x is rotated and boosted 
against the background. This can be measured via some kind of "generalized redshift" of particles in 
the microwave radiation, provided one knows in which state they are emitted (the redshift is nothing 
but the Lorentz-transformed wave vector of the photon, and if the microwaves consist of spinors of 
know polarization, then measuring the "redshift" is exactly what has been described). 

There is also an interpretation for eg G -Sl(2) or the corresponding vector cg a, y a = Cg. The 
expression for q above looks very much like the "addition rule" ( |2.43| ) for distances. It is the geodesic 
distance to the boundary, minus a vector that is obtained by pulling u l vu from the boundary to 
x. Hence, if v is interpreted as the relative position of the fixed stars background with respect to 
the boundary, then eg measures the relative position of x with respect to the background, or simply 
the geodesic distance to some special event in the background, a supernova, say. Remember that V a 
is not really the physical distance but V a given by ( 3.14| ), which of course diverges at the boundary, 



whose physical distance is infinite. Nevertheless eg can be interpreted as the physical distance to 
some special event. For simplicity, we can assume that ~foo(x) = 0. Then you can think of v as some 



renormalized "bare" field v such that the last equation in ( 4.30 ) is a regularized version of the same 



expression with V replaced by V and v replaced by v, which both correspond to real physical but 
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infinite quantities: both the supernova and x are infinitely far away from the boundary but their 
relative position is finite. 

As this "relative position" of x with respect to the background represents a position in both space 
and time, the variables C£ essentially become clocks, measuring the time elapsed since the supernova. 



We can infer this already from the time evolution equations ( 3.72 ) for free falling observers. In 
this case the spacelike part of q is constant (the place where the supernova took place) and the 
time component increases linearly in t. It is not unusual that in a way like this time reappears 
automatically. It is a general feature of canonical gravity on spaces that are either non-compact or 
have boundaries, that additional physical degrees of freedom appear, which are normally not necessary 
to introduce if one is just dealing with Einstein's field equations. Sometimes they arise from extra 
boundary fields being introduced to obtain a well defined Lagrangian, like in our case. In other 



examples, like, e.g., the quantization of spherically symmetric gravity in four dimensions [3C, plf , 
they can also arise from boundary conditions to be imposed on the multipliers and therefore on the 
parameters of gauge transformations. Then some of the symmetries normally considered as gauge 



symmetries no longer fit into the precise definition 3.1. As a result, there are more gauge invariants 
than expected. In the case mentioned one finds that in addition to the Schwarzschild mass, the 
only "classical parameter" of spherical symmetric vacuum Einstein gravity, there is a canonically 
conjugate variable representing the "time at infinity". It is a "conserved current" corresponding to 
time rescaling symmetry at infinity, which in classical gravity is considered as a gauge transformation, 
but which cannot be generated by taking Poisson brackets with constraints. 

How can one use the clocks to recover a time evolution? Without going into detail, let us sketch 
the main idea, using point particle quantum mechanics as a simple example once again. For a free 
particle we can introduce the first order action with parameter time L = xp + TE — H, where the 
Hamiltonian H = qC = q(p 2 /2m — E) with Lagrange multiplier q is, like in gravity, given by a "linear 
combination" of (only one) constraint C. All the variables depend on t, and the equations of motion 

2 

x = —, p = 0, f = q, E = 0, C = — -E&0 (5.2) 
m 2m 

obviously describe a particle in one dimension and a clock T, where the free parameter q indicates 
how fast the physical time runs with respect to t, and the conjugate variable to T is the energy E. 



The constaint C generates a gauge transformation in the sense of definition |3.1| , whose finite version 
has a real parameter £ and reads 



x®=x + ^, r^T«)=T + £, (5.3) 
m 



where the superscript (£) denotes the phase space function obtained by a finite gauge transformation 
with parameter £. The reduced phase space consists of equivalence classes of quadrupels {x,p, T, E} 
modulo this transformation. Canonical quantization is staightforward, we have a wave function 
*[x,T] and 

A physical state has to satisfy 

which formally looks like the Schrodinger equation, but T is not the canonical time coordinate t. 
There is no time evolution with respect to this coordinate: the state is a fixed function of the 
configuration variables x and T. To recover time evolution, one has to ask questions like: what is 
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the value of x "when" T takes the value To. At the classical level, the answer can be given as follows. 
The state is an equivalence class as described above, so choose a particular representative with the 
property T = To, then the value of x for this representative is what you want to know. We can define 
a corresponding phase space function 

x\ T=To = | d£ ^ 6(T® - T ) x«) 

= / d^(T + £-T )(x + ^) =x + (T -T)-^. (5.6) 

Note that (for a fixed value of To) this gives a phase space function which is in fact an observable: 
its bracket with the constraint C vanishes. We obtained this quantity by intergrating over the gauge 
group (R, +), picking up only those configurations with T = Tq. The extra term <9T^/<9£ has 
been introduced to get the correct factor, because the intergral runs over the gauge parameter £, 
wheras the argument of the delta function is a special phase space function, the "clock" T. It is 
the interpretation of T as a clock that makes the last expression look like a time evolution of x. In 
quantum physics we can now ask for the expectation value of x "at the time" T = To, and we just 
have to insert the corresponding operator 

(* \(x\t=t )\ *} = (* \{x + (To - T)£)| ). (5.7) 

But what is the scalar product? It has to be such that all real observables become hermitian, and 
up to rescaling there is only one such product: 

($| q>) = J dx<f>*[x,T]V[x,T]. (5.8) 

There is no integration over T, as otherwise physical states would not be normalizible. The value of 
the product is independent of the special T inserted, which follows from the "Schrodinger equation" 
C$ = Ctf = 0. 

Exercise 5.2. The scalar product is defined on the physical state space only. Show that it is not 
possible to define a product on the whole state space such that the operators x,p, T, E are hermitian 
and restricting it to the physical state space gives (|5lj|). 

Now we can compute the expectation value 

<*|(x|t=t )|*> = /dx(x**tf + ^(T -T)**^). (5.9) 

Knowing that this is independent of T, we choose T = Tq and what we get is the usual expression 
for the expectation value of x at a time To. Hence, formally the wave function ^[x,T] behaves 
exactly like a solution of the time-dependent Schrodinger equation, but T is not the canonical time. 
Neither the state nor the observables depend on the parameter time t, which totally disappears from 
the canonical framework. Nevertheless there is a time evolution, which is encoded in \&[x,T] as a 
correlation bewteen T and x. The state does not make any predictions about a measurement of, say, 
T alone, the result is just random. If you read off a clock for the first time in your life there is no 
"expectation value" for the result. The same holds for x, which might be considered as the position 
of the sun in the sky, which is completely random if you see it for the first time, too. However, there 
is a correlation between the two measurements, and it makes sense to ask for the position of the 
sun seen by somebody who just read off a specific time To from a clock. It is exactly this what the 
observable ( |5.9| ) represents. 

What does this mean for our gravity model? Here we have some more gauge degrees of freedom 
than just the time rescaling above. It is not enough to ask for the value of, e.g., a holonomy fx at 
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a specific time, we also have to specify where the observer is when he makes this measurement. It 
turns out that the following is a reasonable question: what is the value of some function X of the 
holonomies ( |4.30| ) for an observer being in a relative position c D = c and orientation b = b with 
respect to the supernova in the background I = o. Similar to ( |5.6j ) we can define a corresponding 
observable 

/p./ (h,n) i (h,n)\ 
dhdn ^ '°° > 5(c^ - ^(b^b" 1 ) (5.10) 
o(fi, n) 

We are integrating over the gauge group (h,n) £ ISO(l,2) acting on the holonomies. According to 
( |3.68 ) the transformed holonomies are 

b {h, n ) = h -i bo c (h, n ) = h-ifa _ n ) h _ (5.H) 

The first term in the integral is the Jacobian of this transformation, which becomes one if we choose 
the measure to be a Haar measure on ISO(l,2), and define the delta functions with respect to this 
measure. Independent thereof we can solve the integral, as the delta functions pick up exactly one 
term, namely that for 

h = b b 1 , n = c -b b ^cbbo 1 . (5.12) 
As a special example choose X = f x , so that by ( |3.68 ) we have 

f{ h ' n) = h- l (f x + g x ng x - x - n)h. (5.13) 
Inserting all this, we end up with a sightly complicated expression 

/A|bo=b = bbo 1 (f\ + g\{c Q - b D b ^cbb'^gx 1 - c D + b D b ^cbb' 1 ) b D b 1 

c — c 

= bf QtX b 1 - bg Q , x b l cbg Q , x l b 1 + c, (5.14) 

where g Qj \ and f Q: \ are the gauge invariant quantities introduced in ( p. 69 ). Hence, the result is 
indeed an observable, as b and c are just numerical parameters. But where is the time evolution? 
It is not so obvious as in our point particle example, but we have to remember that an observer is 
actually represented by a worldline, so its relative position with respect to the supernova, i.e. the c 
parameter above, will change. Assume that the observer is free falling, then by ( |3.72 ) his position 
and orientation with respect to the background is given by 

b(r) = b, c (t) = c-t 7 o, (5.15) 

where r is his own proper time. Once he found out which values of b and c apply to him, there is 
an observable obtained by inserting ( 5.15 ) into ( 5.14| ), which is parameterized by x's local time 
and obeys 

^ f ( x T) = b~9o,\b- l l* bgo^b 1 - 7o . (5.16) 

This is the time evolution of the holonomy fx as seen by x. It becomes a well defined gauge invariant 
phase space function or quantum observable after specifying the values for b and c, which can be 
obtained by measuring the geodesic distance and the "redshift" of the supernova in the background 
once at r = 0. Hence, for every special observer we can define observables describing the time 
evolution of the universe as seen by this observer, but there is nothing like the time evolution of the 
universe as a whole, expressed in terms of gauge invariant quantities. 
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Figure 4: The basic observables of the cylinder manifold. The wave functional depends on the two 
transport operators g and b corresponding to the paths g and in the picture. The observables are 
constructed from the holonomies corresponding to the paths a Q , ot\ and A, all starting at the special 
observer x. 



Q 




A 



Now this consideration was rather classical, but is should be clear how to transform it to quantum 
physics. As there is no prediction about the values of b and c to be measured first, there is also 
nothing like an uncertainty relations or something like this between them, though the holonomies b Q 
and c Q do not commute. The result of these measurement are random anyway, so x can measure 
them and he will get some values. Then becomes a well defined operator on the physical state 
space and one can compute an expectation value of this observable, or that of an projection operator 
onto some eigenspace of (provided one has the scalar product, which will be derived in the 
next section). The interpretation of this expectation value is the probability for the observer on the 
worldline given by ( 5.15| ) to measure a specific value of fx at a time r. This probability will in general 



depend on r, so the special observer x sees a non-static universe. 
5.2 Hermitian operators and the Hilbert space 

After clarifying the physical meaning of the holonomies and observables, let us come back to the 
state space and the last problem to be solved, namely the construction of the scalar product. We 



already mentioned it in section 4.3, but we saw that it depends on which mathematical observables 
we want to become physical observables represented by hermitian operators having normalizable 
eigenstates. In particular the possibility to measure the winding number of g\ will play a crucial 
role. By exercise |5.1| we now know that it depends on the existence of a boundary of the space Af 
whether g can be measured or not. We will give examples for both cases, the cylinder as a manifold 
with boundaries and the torus as the simplest without boundary in the next section. 

Figure || shows a picture of the compactified cylinder manifold. Instead of drawing a cylinder, it 
is more convenient to consider it as a sphere with two holes, and the picture shows part of the sphere. 
On the two boundaries "o" and "i" we have the angular coordinate </> running clockwise (note that 



on the "negative r" end this is not the <fi coordinate used in sections 2.4 and 3.4: here we need both 
coordinates running clockwise, as otherwise the formulas for the brackets of the boundary fields u 
and v would not be correct). We then choose a point x inside Af as our observer, and select two 
paths a Q and ct\ from x to the boundaries at <j) = 0. They provide the family of boundary paths as 
introduced in figure 0, but we will actually only need these two. In addition, there is the primitive 
loop A. 
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The wave functional of a physical state of the cylinder, as shown in theorem 4J., is an arbitrary 
function of the two quantities g = g Qi \ and b = b 0) i, corresponding to the paths g and (3 in the 
picture, which are already shifted to a small positive angle e to compute the commutators below. To 
obtain the action of some observable U a on a state \F(g, &)), we only have to deform the path such 
that it becomes a combination of g and (3, then the wave function is multiplied by the corresponding 
combination of g and b. For example, we have aY 1 • A z • at\ ~ j3~ l • g z • and therefore 

U ar u x *. a . | F) = Up-i.^.p | F) = | b- x g*bF). (5.17) 

As we want all these observables to be hermitian, the scalar product has to obey 

(F I 3^2) = (gF I ^" 2 >, I bF) = (bF I F). (5.18) 

Or, equivalently, the eigenstates for different values of g or b (represented by delta functions in g and 
b) have to be orthogonal. So far we saw this already for a general manifold, but now we can apply 



exercise 5.1, i.e. we know that there is another observable that measures the winding number of g\ 
and hence distinguishes between the different g corresponding to the same value of g. Of course, we 
want this observable to be hermitian, too. As a result, all the eigenspaces become one dimensional, 
and to make them orthogonal the scalar product must be 

(F\F) = J dgdbg(g,b)F*(g,b)F(g,b), (5.19) 

where Q is a function to be adjusted such that the conjugate observables become hermitian as well. 
The measure chosen for this integral is in principle arbitrary, as any redefinition can be absorbed 
into the function Q, but the most natural choice is to take a Haar measure on SL(2) and SL(2), 
respectively. 

To fix the function Q, we have to compute the action of the V observables. As we know from 
section |43|, we only need to compute the action of two special V observables corresponding to two 
independent self-intersection free paths, and make them hermitian. Their action on physical states is 
obtained by ( 4.6l| ), where the "quantum correction" term vanishes. So what we need are two paths 



such that they have no self-intersections and, if possible, simply commutators with g and b, so that 
the commutator with J-(g, b) becomes simple, too. A good choice is 

7 = Oq 1 • A -1 • a?i, ? = aT 1 • A -1 • a\. (5.20) 

The first is obviously self-intersection free, as it passes from one boundary to the other. The second 
is a loop, but if we shift its beginning, as required in exercise ETl(| to <f) = e (see fi gure H), all 
intersections can be removed. Here it is essential that A -1 appears and not A, which whould lead to 
a self-intersection. The reason for choosing these two paths is that each intersects with one of the 
paths (3 and g once only, so we get simple commutators. With the graphical rules from figure || we 
have 

[V, a ,Up] = h^UpY, [V y a ,U g ] = ±i?iW 7 7 a . (5.21) 
This immediately leads to the representation (using ( 4.62; )) 



V ? a |^) = |imTV(b 7 a |^)), v 1 a \F = \\^{gi a ^))- (5-22) 

Hence, the observables act by generating multiplication of g or 5, respectively, from the right. This 
is exactly the transformation under which the Haar measure in ( 5.19j ) is invariant. To render these 



64 



operators hermitian, we have to choose Q = 1 (or any other constant). Hence, the scalar product 
takes the simplest possible form 



(Fi.\ft) = J dgdbFS&fyft&b). 



(5.23) 



To illustrate some of the results of section 4.3, let us see how other observables act and which kind of 



ordering ambiguities appear. Consider the path j3 in figure |4j. It is obviously free of self-intersections, 
so we expect Vp a to be free of ordering ambiguities. There are two ways to derive the quantum 
operator. 



Exercise 5.3. Using the graphical rules from figure |3], show that 



(5.24) 



Hence, assuming that there are no "quantum corrections" in (4.61) for this operator, we have 



VI*>-|i»»(Sr*S )-i»»(7**^)>. 



(5.25) 



dg ' \ db 

This is obviously hermitian, too, because the Haar measure in invaraint under multiplication from the 



left as well. Another way to obtain the same operator is to use that (3 • ? = 7, so the relation ( 4.46 
gives 



From ( 5.21 ) it is easy to infer that 

so we can reorder the factors as we like and get 



(5.26) 



(5.27) 



V f3 a \F)=V J a \F)-U f3 a b V< ; b \F) 



±ihb a b Tr(bj' 



db ) 



(5.28) 



With b a b 67^ = 7 a b this becomes exactly the same as the operator given above. As another example, 
we can compute the observable V using ( |4.57| ) : 



V^i= -\u^ b a v, h -\v q b u^ b a 

lhi 6 



(5.29) 



Exercise 5.4- Evaluate the commutator, using s = j3 • q 1 • (3 and the identity "y a g^ a = —9 + 
2Tr(g)l for SL(2) matrices. The result is 



l[V, b ,Kb a ] = ^hTT{U e )TT(Up l U e U^ a ), 
and the resulting quantum operator becomes 

V ? °i \F) = \ imTr(6- 1 fl r-V3b7 6 ) Tr( 7 a & ^ 



(5.30) 



+ 



1; 



\hTi{g)Ti{b- l gb^ a )J : 



(5.31) 
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Here we get a quantum correction in form of an extra imaginary multiplication term of order h, and 
it is also obvious that there is an ordering ambiguity in the first part of this expression. If we bring 
the differential operator to the left, then it also acts on the prefactor, which depends on b. But the 
extra term is exactly what is needed to make the operator hermitian under the scalar product ( 5.23j ). 



which can be checked by partial integration. Hence, we see that the "quantum corrections" appearing 
in (4.61) are really necessary to obtain a full set of hermitian V operators. 



There are many other interesting questions about the physics described by this state space, and 
most of them are still to be explored. For example, one can now ask for the eigenstates of the V 
operators and their spectrum, look for complete sets of commuting observables other than the Id's 
to provide another basis for the state space, or find the uncertainty relations for the observables 
and construct semi-classical universes with minimum uncertainty. All typical questions arising in 
quantum mechanic can be translated to similar questions for our toy model of quantum gravity, and 
they will certainly also be interesting for quantum gravity in general. We will not go into more details 
here, as all this is far beyond the scope of this article. However, an article about three dimensional 
canonical gravity would not be complete without dicussing the standard example, so the last section 
will be a rather mathematical derivation of the physical Hilbert space for the torus. 

5.3 Torus quantization 

Quantization of three dimensional canonical gravity has mostly focussed on the torus, because, form 
a mathematical point of view, this is the simplest non-trivial manifold (for a really huge collection 



of references see [32|). In contrast to the cylinder manifold, there are no boundary conditions for the 
fields and we don't have to introduce the extra fields u and v, which makes the canonical formalism 
easier. On the other hand, it is the lack of the background structures associated with the boundary 
fields that makes it more difficult to extract physical information. We considered this in detail for 
the cylinder manifold above, so let us focus on a mathematically rigorous construction of the Hilbert 
space in this section. 

On the torus we have two primitive loops A and k, and they obey the relation A • k = k • A. 
Hence, the loop^group is Abelian and the wave function is Figxidn), which has support on pairs 
of commuting SL(2) elements, and must be invariant under inner automorphisms ( [4. 28 ). Before 



constructing the state space explicitly, let us remember what we know about observables measuring 



the "winding number" of the holonomies g\ and g K . As the torus has no boundary, exercise 5.1 tells 
us that there is no way for an observer sitting on the torus to distinguish between the holonomies 
gx andjL2zf?A) which are projected onto the same matrix gx £ SL(2) (see ( 1.21| ), where the elements 



I22 £ SL(2) have been introduced as the "loops" of SL(2); the maps g 1— > l2 Z g are the "shift" maps 



on SL(2) as introduced in section 1.1). Only the matrices gx and g K can be measured by passing a 



spinor once around the primitive loops. As a consequence, there are operators 

Vx\F{gx,gn)) = |^(l2flfA>flfc))> 

v K \ r(g x ,g K )) = I Hax, i2g K )}, (5.32) 

commuting with all physical observables: if there was any observable which did not commute with 
Vx , then it would be possible to measure the winding number of gx- The operators V are very much 
like the the permutation operators for identical particles in quantum mechanics. They also commute 
with all observables, as it is exactly this what "identical particle" means. In both cases the state 
space splits into "superselection sectors" . They are the eigenspaces of the V operators, which can 
always be diagonalized simultaneously with any other set of commuting observables. 

In multi-particle quantum mechanics, the square of any primitive permutation operator is the 
identity, which restricts its eigenvalues to be ±1, leading to bosonic and fermionic states. In our case 
the algebra of the V operators is free. Thus any eigenvalue is allowed except zero, because the Vs are 
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invertible, and for any pair of non-vanishing complex numbers q\, q K we get a superselection sector. 
Note that we do not even have a restriction for the eigenvalues to be imaginary or real, because a 
priori there is no reason for the V to be (anti)hermitian. 

Exercise 5.5. Show that all sectors are isomorphic: for any pair q\,q K £ C — {0}, one can define 
(quasiperiodic!) functions C\, C K : SL(2) — > C — {0} such that C\(i-2g) = q\C\{g) and similar for k. 
Then 

I F(g\,9 K )) >-> I Q\(ffA)C^ff K )^(ffA,fl/c)) (5.33) 

is a one-to-one map from the sector with eigenvalues (1, 1) onto the sector (q\, q K ). 

Hence, we only need to consider one special superselection sector, knowing that all others are isomor- 
phic (in contrast to quantum mechanics, where the bosonic anf fermionic sectors are not isomorphic). 
The simplest choice is of course the sector with eigenvalues 1 for both V operators. In this sector 
the wave function T depends on the SL(2) matrices g\ and g K only. 

To get a basis for the state space, we have to find the moduli space of the torus, but with respect 
to SL(2) instead of SL(2). This is not very difficult, the relevant properties of SL(2) have been studied 
already in section |1.2| . We found (exercise |1.9| ) that two matrices commute if and only if they lie on 
the same line through the origin in figure but they are allowed to lie in different parts. This line 
is unique if not both matrices are "null" elements of SL(2), i.e. either 1 or —1. In this case they 
do not transform under inner automorphisms, and we get four special equivalence classes of pairs of 
commuting matrices called 

C^(1) = {(*aM*1)}, (5-34) 

each consisting of a single pair only. In all other cases we can find a non- vanishing vector v a , an 
angle 9 and two signs Z\, z K = db such that the matrices are 

g\ = g(v a cos9,zx), g K = g{v a sin9,z K ). (5.35) 

Here we used the special coordinates g(v a , ±) on SL(2) introduced in ( |1.15| ). The vector has to be of 
length squared greater or equal to —1, so the range for v a and the angle 9 is restricted by 

v 2 > -l/cos 2 <9, v 2 > -l/sm 2 9, v a ^0, (5.36) 

where v 2 = v a v a . There is still a sign ambiguity in this parameterization as v a \— > —v a , 9 t—* 9 + ir 
does not change the holonomies. We can fix this partly by requiring that v a lies inside or on the 
upper lightcone if it is timelike or lightlike, then the only remaining ambiguity occurs if it is spacelike 
or lies on the boundary, where one of the signs is arbitrary. 

How do the equivalence classes look like? In section |l.2| we found that an inner automorphism 
acts on the vector v a like a proper Lorentz rotation, and this is of course still true with extra factors 
of cos 9 or sin# here. Given two vectors v a with the same length squared v 2 , then we can always find 
a proper Lorentz rotation mapping them onto each other. This is obviously true for spacelike vectors 
as the hyperboloids with v 2 > are connected. For timelike (or lightlike) vectors v a this follows 
because then by definition the vector always lies on the upper hyperboloid (or lightcone). Hence, 
an equivalence class of pairs of commuting non- null holonomies is determined by the length v 2 , the 
angle 9, and the two signs Z\, z K . We denote it by 

C Zx , ZK (v 2 ,9) = {(g(v a cos9,z x ),g(v a sm9,z K ))}. (5.37) 

Here v a runs over all non- vanishing vectors with the given length squared v 2 , lying inside or on the 
upper lightcone if v 2 < 0. Fixing the two signs z\,z K , we can draw a picture of the corresponding 
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Figure 5: The set of equivalence classes of pairs of commuting SL(2) elements. Each class is param- 
eterized by a length v 2 , an angle 6, and two signs Z\,z K . The picture shows the part C++, which 
forms a cylinder with a rather peculiar looking boundary. The complete moduli space is obtained 
by identifying 9 with 9 + tt in the spacelike region v 2 > and gluing the parts for different z\,z K 
together along the boundaries as indicated. 




region of the moduli space, which is shown in figure ||. It consists of the timelike region v 2 < 0, which 
is bounded by ( |5.36| ), a lightlike circle at v 2 = and the spacelike part v 2 > forming a cylinder 
that extends to infinity. In addition, there is an extra point C ZXjZk (1) in each region. 

The coordinates are not yet unique. In the spacelike region we find that the two classes for 9 and 
9 + 7r are actually the same. The reason is that we can rotate a spacelike vector into its negative 
(this is not possible for a timelike of lightlike vector). We have C Zx ^ K {v 2 ,9) = C ZX)Zk (v 2 ,9 + tt) for 
v 2 > 0. This leads to the identification shown in the picture. Finally, the parts for different values 
of z\, z K have to be glued together. 



Exercise 5.6. By using the relation ( 1.18 ), which tells us how group elements that lie on the bound- 
aries are identified, show that the cylinders in figure [| are glued together as indicated in the picture, 

i.e. the timelike region of C++ is glued to C ^,C_| etc. along the boundary, so that all four parts 

together form a torus, with four cylinders extending to infinity and four extra points. 

A picture of the compete moduli space of the torus is shown in figure |(| An interesting, but for our 
purpose not very important feature of this moduli space is its non-Hausdorff topology: 

Exercise 5. 7. The moduli space has a natural topology. It is a quotient space of a closed subset of 
SL(2) x SL(2) modulo an equivalence relation. Show that this topology is locally like R 2 at a generic 
point (otherwise we could not draw a picture), but non-Hausdorff at the lightlike circles and the 
singular points C ZX:Zk {1). In particular, every open neighbourhood of such a point contains the whole 
lightlike circle, and two opposite points on this circle cannot be separated by open neighbourhoods. 

We can now give a basis for the physical state space. There is one state for each point in the 
moduli space, and the corresponding wave functional is given by a distribution with support on 
the corresponding equivalence class of commuting SL(2) elements only. As we will not integrate or 
differentiate with respect to the holonomies, we do not have to introduce distributions etc. on the 
moduli space. If you want to consider the states as wave functions, just take J r (g\,9 K .) to be one on 
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Figure 6: The moduli space of the torus, consisting of the equivalence classes of commuting SL(2) 
matrices modulo inner automorphisms. It consists of four spacelike sectors in form of cylinders 
extending to infinity, where opposite points are identified, glue to a timelike sector, which is a torus 
with four holes. In between there are four lightlike circles and null points. Every point in this picture 
(except for the identifications in the spafcelike sectors) corresponds to exactly one basis element of 
the physical state space. 




the considered equivalence class and zero elsewhere. The basis elements of the state space are 



Z\,Z K ,V ,1 



(5.38) 



where the range of the parameters and the identifications are of course the same as for the equivalence 
classes. Let us introduce another set of coordinates on the moduli space, which will be more conve- 
nient later on. The states |1, z\, z K ) will be called null states, and the remaining will be divided into 
timelike, lightlike, and spacelike states, depending on v 2 . Of course, this means that the holonomies 
are timelike, lightlike or spacelike in the sense introduced in section [E^. On each of these sectors we 
will introduce separate coordinates and "standard representatives" for the matrices. 

Let us start with the timelike states. By a proper Lorentz rotation, we can always achieve that 
the vector v a is vertical in figure |2|, i.e. it lies on the 70 ax is an d the holonomies are elements of the 
S0(2) subgroup. Then we can find two real numbers a, b such that 



9\ 



cos a sin a 
— sin a cos a 



9k 



cos b sin b 
— sin b cos b 



(5.39) 



The parameters a, b are obviously periodic with period 2tt, but apart from this they are unique, 
because every timelike hyperboloid intersects the S0(2) subgroup exactly once. However, a and b 
cannot be both multiples of tt, as then we are back in the null sector. The states will be denoted by 



T,a,b), (a, b) ^ (mir, nii), m,n^7j. 



(5.40) 



The new parameters a, b are very different from the old v 2 ,9 coordinates, as we now have one set 
of coordinates for the whole timelike sector, which was glued together in a complicated way before. 
Now we can easily see that it is a torus with four holes, but the boundaries are shrunk to points. 

For the spacelike classes the result is similar. We can always rotate a spacelike vector such that it 
lies on the horizontal axis in figure 0, i.e. the 71-axis. In contrast to the timelike sector, the spacelike 
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sectors as shown in figure |6| are not connected, so we have to keep the two signs as parameters. The 
commuting matrices representing a spacelike class are 

/cosh a sinh a \ _ /cosh b sinh 6 \ , , 

9x — z x\ • i, „ „ i „ ) 9k — Zk \ ■ , , „ , , , (, 5 - 41 J 



\ sinh a cosh a / K R \ sinh 6 cosh b 

and the corresponding states are called 

\S,z x ,z K ,a,b), (a, b) ^(0,0) (5.42) 

Here we have to exclude a = 6 = 0, which yields a null class. Of course, this corresponds to 
the boundary of the spacelike region, which is also shrunk to a point. In addition, this time the 
parameters are not unique, as a given spacelike hyperboloid intersects the 71 axis always twice and 
we can rotate a spacelike vector into its negative by a continuous transformation. This leads to the 
identification 

\S,z\,z K ,a,b) = \S,z\,z K ,-a,-b), (5.43) 

Finally, every lightlike vector can be rotated into the special vector v a = (^,^,0), because every 
vector on the upper lightcone (by definition v a lies always there) can be transformed into every 
other. We then keep the old coordinate and get the representatives 

/l cos#\ /l sin#\ , ., 

»* = *Ho 1 )' 9x = Zk {o 1 )' (5 - 44) 

and the state functionals are denoted by 

\C,z x ,z K ,6). (5.45) 

To summarize, we now have a complete basis of the state space of the torus, and a general element 
of this state space can be expanded as 

\F) = ^1(^,^)1 l,Z\,z K ) + j> dadbJ : T {a,b) \ T,a,b) 
+ Y dadbj r s (z\,z K ,a,b)\S,zx,z K ,a,b) 

z\,z K J 

+ J2 <fdOFc(zx,z K ,0)\£,z x ,z K ,e), (5.46) 

where the wave function Ts is assumed to be symmetric Fs{z\, z K , a, b) = J^sizx, z K , —a, —b), and Tr 
and Tc are periodic in a, b and 6, respectively. 

Exercise 5.8. The v 2 ,6 coordinates above together with the a,b coordinates on the timelike and 
spacelike sectors provide a full atlas of the moduli space in figure || (ignoring the singular points 
and the identifications in the spacelike sector to make it a proper manifold). Find the transition 
functions. 

Now we have to find out how the observables act on the basis of the physical state space. As there is 
no boundary, we only have the trace observables and some exceptional operators representing gauge 
independent properties of the holonomies which are not contained in the trace. Because the loop 
group of the torus is Abelian, every loop can be written as g = X m • K n for some m,n E Z. The 
corresponding trace observables are 

U(m,n) = ±Tr(g e ) = \ Tr(l/,(0, 1)), 

V(m,n)=Tr(g e f e )= [ ds Tr(U Q (0, s) U g (s, 1)). (5.47) 
Jo 
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The IA operators are obviously pure multiplication operators and they commute with each other. In 
fact, they form an algebra, as every product of them is again a linear combination. Explicitly, we 
have 

2U(m, n)U(p,q) = IA (n + p, m + q) + IA (n — p, m — q) , (5.48) 
and, as well as the V's, they are not independent. The most important relations are 

U(m, n) = U{-m, -re), U(0, 0) = 1, 

V(m,n) = V(-m,-n), V(0,0)=0. (5.49) 
All these relations are derived from the following useful identities for SL(2) matrices: 

g + g- 1 = Tr(gr)l, g - g 1 = Ti(g>y a ) la , 

Tr(flf) Tr(flr') = Tr(gg') + Trfa^g'). (5.50) 

It is now straightforward to find out how the IA operators act on the physical states. They multiply 
them by the trace of the corresponding matrix g\ m g K n , and since the trace is gauge-invariant, we 
may insert the "standard representatives" given above for the holonomies, i.e ( |5.39 ) for the timelike 
states etc. The result is 

U(m,ri)\ l,z\,z K ) =z x m z K n \ l,z x ,z K ), 
U(m, n)\ C, z\, z K , 9) = z x m z K n \ C, z\, z K , 9), 
U(m, re) | S, z\,z K , a, b) = z\ m z K n cosh(ma + nb)\S,z\,z K ,a,b), 

U(m,n)\T , a, 6) = cos (ma + nb)\ T, a, b). (5.51) 

The basis states are obviously eigenstates of the U operators, but some eigenspaces are still degener- 
ate. Only the spacelike eigenspaces are one dimensional, because by ( 5.43| ) a, b and —a, — b represent 
the same state. In the timelike sector, the eigenspaces are two dimensional consisting of the states 
|T, a, b) and \T, —a, —b), and the null and lightlike sectors are completely degenerate. 

Exercise 5.9. Show that the following observables together with the IA operators form a complete set 
of commuting operators such that all eigenspaces are non-degenerate: 

1 for g\,g K = ±1, 
otherwise, 

sign Tr(fifA7o) if 9\ is lightlike or timelike 
otherwise , 

sin O Tr( 5A7a ) = cos TV(^ 7a ). (5.52) 



£ 



The first checks whether the state is a null state, the second, together with a similar observable for 
k, tells us whether the matrices lie in the upper or lower lightcone if they are timelike or lightlike. 
In the third equation the angle & is defined modulo n only, but together with the Z observables we 
can fix its value on the lightlike states by requiring 

0\£,z x ,z K ,6) = e\jC,z x ,z K ,6). (5.53) 

As we want all these observables to be hermitian, the scalar product on the physical state space has 
to be such that the basis states are orthogonal. Of course, we can then normalize the basis to be 
orthonormal and the scalar product reads 

(l,z x ,z K | l,zx,z K ) = 1, 
(£, z\, z K , 9 | £, z x , z K , 0') = 5(9 - 9'), 
(S, zx, z K ,a, b | S, z\, z K , c, d) = 5(a - c) 5(b - d) + 5(a + c) 5(b + d), 

(T, a,b\T, c, d) = S(a - c) S(b - d), (5.54) 
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and vanishes if the signs z\, z K do not coincide. Note that this is actually not the final definition of 
the scalar product, it just defines the product as a function of the basis elements. As the basis was 
only required to consist of eigenstates of the U operators, we can still multiply all basis elements by 
arbitrary complex factors. If we want the V observables to be hermitian as well, we now have to 
choose the basis states themselves appropriate. 

The procedure to derive the product is slightly different from that used for the cylinder. What 
we are going to do can be explained most easily using our standard example again. Assume we have 
a particle in one dimension and let IA be the position operator. Then what we defined up to now 
is a basis consisting of eigenvectors U\x) = x\x), but we actually did not fixed the normalization 
of the basis vectors \x). Nevertheless we can define a scalar product depending on the basis by 
(x\y) = S(x,y): for each choice of basis states we get a different scalar product. Now there is another 
operator V with [V,W] = ih. The task is to fix the basis \x) such that V becomes hermitian. From 
the commutator we learn that 

[V,U]\x) =ih\x) V\x) =ih-^\x) -hg'(x)\x), (5.55) 

where g is an unknown complex function. We can change the basis \x) *— > exp(ig(x)) \x), and in the 
new basis V simply acts as an differential operator, and it also becomes hermitian with respect to 
the scalar product derived from the new basis. In principle, it would be sufficient to render g real, 
but then we can still multiply the basis states by a phase factor, which does not change the scalar 
product any more, but simplifies the action of V on the basis. As a result, we get a basis and a scalar 
product for the state space, without having to know their explicit representations in terms of wave 
functions. 

Hence, all we have to do is to compute the commutator algebra of the IA and V observables, then 
we know their action on the basis states up to some function like g. Provided that it is possible 



to make all V observables hermitian, there must be a choice for the g function such that ( 5.54 ) is 
the correct product. Computing the commutator of IA and V is straightforward. We rewrite the V 
operators as 

V(m,n)= f 1 ds TT(U e (0,s)-f a U e (s,l)) ' i (s)e i a (e( S )), (5.56) 
Jo 

where g is any loop homotopic to X m • n n . Choosing another loop ? ~ A p • n q such that it intersects 



g at finitely many points only, we can use the commutator ( 4.41 ) to get 

-l 



[V(m,n),U(p,q)] = / ds Tr(U e (0,s)j a U g (s,l)) g' 1 (s)[ ei a (g(s)) , Tr(J7 ? (0, 1))] 



o 



■\m I dsdte^/ts)^) J(£(s),^(t)) x 

Tr(t/ e (0, S ) 7a C/ e ( S ,l)) Tr(£/,(0,t) 7 a C/ ? (M)). (5.57) 



o 



Using the second formula in ( |5.50| ) the product of the two traces becomes 

Tr(U e (0,s)U,(t,l)U,(0,t)U e (s,l)) 

- Tr(U g (0,s)U,(t,0)U,(l,t)U g (s,l)). (5.58) 

The delta function picks up a term only at intersection points of the two curves. Hence, the first 
of the two traces is the trace of a transport operator going along g to an intersection point, then 
around ^ to the same point again, and finally further along g, the second term is the same but the 
direction of ? is reversed. The values of these transport operators are in fact independent of the 
special intersection point, as on the torus they depend on the two winding numbers of the loop only. 
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In particular, they take the same values for every intersection point. The integral then simply counts 
the intersections of the two loops, as already seen in exercise 4.5. Choosing the orientations of the 
loops such that the intersection number of A with k is +1, the intersection number of X m • K n with 
X p • K q is mq — np. We finally end up with the commutator 

[V(m, n),U(p, q)] = ^ih(np — mq)(U(m + p,n + q) — Uim — p,n — q)). (5.59) 



Exercise 5.10. Show that a similar relation holds for the commutator of two V operators: 

[V(m, n), V(p, q)] = \\h{np — mq)(V(m + p,n + q) — V(m — p,n — q)), (5.60) 

either by computing it straightforward using fl4.41| ), or by showing that it is linear in V again and 
using the Jacobi identities. 

We can now compute the action of the V operators on the basis states. For the timelike states, we 
have 

[V(m, n),U(p, q)]\ T, a, b) = ^ih(np — mq)(U(m + p,n + q) — Uim — p,n — q))\ T, a, b) 

= —ih(np — mq) sin(ma + nb) sin(pa + qb) \ T, a, 6), (5.61) 

where we used the formula cos(x + y) — cos(a; — y) = — 2 sin x sin y. We can rewrite the eigenvalue 
appearing in the last expression as 

ihsm(ma + nb)(n— — m—) cos(pa + qb), (5.62) 
so that the action of the V operator is fixed to be 

( d d \ 

V(m, n) I T, a, b) = —ih sin(ma + nb) I n— — m— I T, a, b) , (5.63) 

V oa do ' 

up to an additional multiplicative term. However, one can easily check that the commutator relations 
between the V operators are satisfied without "quantum corrections" , and in this form they are also 
hermitian with respect to the scalar product ( 5.54| ). Using our argument from above, any possible 
multiplicative term can now be absorbed into a redefinition of the basis, so that the last equation 
gives the correct action of the V operators on timelike states. The computation for the spacelike 
sector is similar and gives 

V(m,n)\S, z\, z K ,a,b) = -ihz\ m z K n sinh(ma + nb) ( n- m— ) I S, z\, z K , a, b). (5.64) 

\ oa ob > 

Another important fact to note here is that the vector fields represented by the V operators on the 
timelike and spacelike sectors of the moduli space all vanish at the excluded points (0, 0) for the 
spacelike sector and (mn, nir) for the timelike sector. Hence, acting with any observable on spacelike 
and timelike states will never produce a state outside these sectors. We get another split into 
superselection sectors. The lightlike sectors should then of course also be mapped onto themselves 
by the V observables. To see how they act there, we have to use some formulas: 

Exercise 5.11. Compute the commutator of the V observables with the matrices g\ and g K . It is 
essential that they commute (as matrices), otherwise the result would depend on the special loop 
chosen in (5.56): 

[Vim, n),g x ] = \inn(g\ +m g n K - g^V) 

[V(m,n),g K ] = -\\n m (g^ g l + n - g^ m gl~ n )- (5-65) 
Show also that for lightlike matrices we have 

M9x m 9n n la) = mTr(g xla ) + nTr(g Kla ). (5.66) 
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We can now derive the commutator of V(m,n) with the angular variable introduced implicitly 
by ( |5.52 ). We just compute the commutator with the whole equation, which of course has to vanish. 



This gives 

-[V(m, n), 6>]( cos 9 Tr(g xla ) + sin© Tr(g K j a )) 

= sin (9 [V(m,n), Tv(g x ja)} - cos <9[V(m, n), Tr(g K j a )] 

= ih (nsin<9 + mcos<9) (mTr(g\~f a ) + n Tr((/ K 7fe)), (5.67) 

where we used the formulas given in the exercise and the fact the the matrices are lightlike. Now ( |5.52| ) 
implies that there is a vector v a such that Tr(<7A7a) = 2w a cos@ and Tr(g K 7 a ) = 2v a sin@ (which of 
course is the vector introduced in ( |5.35| )). Inserting this into the last equation gives 

[V(m, n), 0] = ih (nsinO + mcos O) 2 (5.68) 

From this we could infer that V(m, n) has to acts on lightlike states as 

d 

V(m,n) | £, z\, z K , 9) = -ih(nsin9 + mcos9) 2 — \ C, z\, z K , 9), (5.69) 

<j9 

but this not hermitian. Here we have to add an imaginary multiplicative term, or even simpler we 
have to "symmetrize" the operator: 

d 

V(m, n) | C, Z\, z K , 6) = —ih (nsinO + mcos 0)— (nsm.9 + mcos 9) \ C, z\, z K ,9), (5.70) 

o9 



which is obviously hermitian with respect to the product (p. 54 ). This completes the construction of 



the physical Hilbert space for the torus. All observables now have well defined action on the basis 
states and we saw that it is possible to render all these operators hermitian with respect to a unique 
scalar product. The state space splits into 13 superselection sectors, the four spacelike, lightlike and 
null sectors and the single timelike sector (in addition to the split into superselection sectors with 
respect to the V operators above, leading to sectors which cannot be distinguished by an observer 
sitting on the torus). A general state functional is given by expanding it as shown in ( 5.46| ), and 



restricting the wave functions of the non-compact spacelike sectors to be square integrable, i.e. 

J da db r s *r s < oo, (5.71) 

we finally get a well defined Hilbert space. Using that the scalar product is unique up to rescaling, 
we can answer some questions arising in a naive description of the state space of the torus, where one 
assumes that the wave function can be written as function of the traces of holonomies. First of all, 
we immediately see that in this way we cannot cover the whole state space, because the traces alone 
cannot distinguish between lightlike and null states. So already then we loose some states. Some 
approaches go even further trying to construct a Fock-like space based on the IA operators as creation 
operators. This leads to the so called loop states or loop representation, providing a discrete basis for 
the state space. There have been some discussions about the existence of such a basis [33, 34, 35, j36|, 
so let us close this section with a brief look at these states. 

We first have to find a suitable definition for loop states in our notation. Usually they are defined 
as wave functions J- = Tr(g g ), but as we do not know how our basis above looks like when expressed 
in terms of wave functions, we have to construct them using some basic properties. In fact, the 
loop states can be generated as follows. There is a "vacuum" 1 0, 0) , represented by a constant wave 
function. In our formalism, this means that acting on it with the differential operators V(m,n) gives 
zero. Then the loop states are 

\m,n) =U(m,n)\0,0). (5.72) 
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They obviously have a nice generalization to higher genus manifolds, because we get a state for each 
homotopy class of loops on J\f. By using commutation relations, one can derive the action of the U 
and V operators on these states, and they are quite simple. However, in general the loop states are 
not normalizable, and they actually provide a very inconvenient basis of a subset of the state space. 
If V(m, n) |0, 0) = for all m, n, then all the functions T in the expansion ( |5.46| ) must be constants, 
but not necessarily equal. If we want the vacuum to be normalizable, the spacelike parts must vanish, 
because otherwise we would get an infinite contribution to the norm of the state. So loop states can 
only be timelike, spacelike, or null, if we want them to be normalizable. On the other hand, in the 
null and lightlike sectors we cannot build up a Fock structure, because acting with the IA operators 
on the vacuum just reproduces it. Hence, the only useful loop states are timelike. We can compute 
them explicitly and get 

\m,n) = <j> dadb cos(ma + nb) | T, a, b). (5.73) 

Note that in a similar construction for the spacelike sector a cosh would appear, so that the norms 
would be even more divergent than the norm of the vacuum. For the timelike states, however, the 
norm is finite, and they are in fact orthonormal: 

(m,n\p,q) = 2tt 2 {5 mi p5 n ,q + 5 m - p 5 n - q ), (5.74) 

the symmetrization appearing here because the U operators have the symmetries (|5.49| ). But still 
they cover only half of the timelike sector, the remaining states with cos replaced by sin cannot be 
written as loop states. Hence, we have to conclude that loop states do not provide a suitable basis 
for the physical state space. They only exist when restricted to the timelike sector, but this means 
that they are not represented by wave functions of the form T = Tr(g\ n g Ki m ), which doesn't vanish 
on the spacelike sector. Even the existence of the timelike loop states is rather accidental and a 
special feature of the torus. It is essential that there is a superselection sector of the state space 
corresponding to a finite region of the moduli space. For the cylinder there are no such sectors and 
the wave functions J-(g,b) = Tr(g n ) are obviously not normalizable. 

Coming back once again to simple point particle quantum mechanics, it is quite obvious to see 
what goes wrong. If you want to find a Fock-like basis for the state space for a particle in one 
dimension, you can, e.g., take the harmonic oscillator eigenfunctions to be your basis states, and 
this of course works perfectly, because you have the "correct" relations between the creation and 
annihilation operators, which are the hermitian conjugate of each other. For the loop states this 
is totally different, as the creation operators themselves are hermitian. The quantum mechanical 
analogy would be to define the "vacuum" by p\0) = and to create the basis states by x\n) = \n + 1). 
This obviously leads to the wave functions ^ n (x) = i", which are not normalizable and do not 
provide any reasonable basis of the state space. The only exception occurs if the range of x is finite, 
and this corresponds to our timelike loop states. 



Appendix 

In section ^ we did not give the details of how to compute the equations of motion and the Poisson 
brackets for the boundary fields u and v. The technical details of these computations are rather 
cumbersome, though the results can be summarized in the simple equations ( |3.28| ) and the brack- 
ets ( 3.32^p734 ) . In principle only one of the two calculations is necessary, as once we have the Poisson 



brackets we could obtain the equations of motions by computing the brackets with the Hamiltonian. 
However, let us nevertheless do both independently and check whether the results are correct. 

To compute the equations of motion for u and v, we split the Lagrangian into an interior and 
a boundary term. We take the interior term to be everything which does not depend on u or v, 
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i.e. (|3.9|) , plus the cj^ and 7^ part of ( |3,24 ). With Stokes' theorem we can write this as a single 
integral over the interior of J\f: 



Lmt = Jd 2 x e tJ Tr(u>;7j - L\u>t Jj - drh u>j + Jt^j)- 



(A.l) 



The boundary term is the rest of (|3.24|) plus the kinetic term (|3. 27| ) . By using ( 3.23 ) we can rewrite 
it as 



(A.2) 



i 

- j> d(f) Tr(w _1 'U7^ - cjt70 - 7^ 



where 7^ and cj^ are periodic functions of u and v: 

= u^d^u, 7^ = u^d^v u. 



(A.3) 



Only the multiplier fields appear in both parts and we can derive the equations of motion for the 
remaining fields independently. Varying u>i and 7^ in L; nt immediately gives ( |3.25| ). Now consider 
a variation of v in Lbnd- As v * s completely (and uniquely) determined by v(<fio) and 7^ via the 
differential equation ( |A.3j ), we can equally well vary 7</> and v(4>q) independently, which gives us the 
equations of motion 



U U = Uf, 



Ug v,£ = 0. 



(A.4) 



So we already found the first part of ( [3.28 ). The variation of u is slightly more complicated, because 
we cannot vary u without varying v: the relation ( 3.22[ ) has to remain invariant. However, as before 
we can vary uj^ and u at some special point independently, keeping 7^ and v((/)q) fixed. To find the 
variation of ^bncb we can use 



5(u = u 1 dt{5uu 1 )u, 



(A.5) 



and a similar formula for U£, which follows from the identity du~ l = —u duu for any matrix u 
and any differential operator d. Inserting this and integrating time derivatives by parts if they act 
on variations, we find 

^bnd = E ( " MuF^ugv^o)) + <f d<p Tr(Suu-%v + y t 5^)) , (A.6) 



where we also dropped terms with iig vanishing by (A.4). Integrating the d$ derivative by parts, we 
can use 



-In 



and we have to take a boundary term into account: 



uS(u 1 d ( j ) u)u 1 = u 1 5us ( f ) u, 



(A.7) 



1 

+ f dc/> Tr(to (7 t - 



Making use of (3.22) again, we find that the first term can be converted into 



^bnd = E (T^uu-^o + 2n)v e ) - I d<f> Tr(5. 
f J<f>o 



UU(U 1 vu-j t 



(Ai 



(A.9) 
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Thus we may consider du^ and 5u((f)Q + 2ir) as independent and get the equations 

u~ 1 i)u = ~f t , V£ = 0, (A. 10) 

which is the second part of ( |3,28| ), 

The Poisson brackets of the boundary fields are also slightly complicated to obtain, mainly because 
they are group and algebra valued and quasiperiodic. There are various methods to derive the 
brackets. For example, one could treat the conditions ( |3.22 ), defining u and v to be quasiperiodic, 



as second class constraints, as well as the requirement that u £ SL(2) and v £ sl(2). Then u and 
v would become matrix fields on a real axis and one could derive the Dirac brackets in the usual 
way 0]. For group valued fields, there are methods to avoid this detour via Dirac brackets |37|, 
but still we have to deal with the quasiperiodic character of the fields. However, after all the only 
requirement to be imposed in the brackets is that they reproduce the correct equation of motion, 
which have to coincide with the Euler Lagrange equations just derived. This leads to a nice trick 
to check whether some "suggested" brackets are correct, and we will use this trick to show that the 
brackets for the boundary fields given below are the right ones. This is much easier than calculating 
them by extracting the canonically conjugate momentum of u{4>) and solving second class constraints. 

To use this trick, we define a very special bracket, namely that of a phase space function F 
with the kinetic part of the Lagrangian, which is always assumed to be of first order, so that the 
kinetic part simply consists of the terms containing a time derivative. The Lagrangian then splits 
into L = L k [ n — H, where H is the Hamiltonian. In general, the Poisson bracket can be written as 

W«> = E5£ {*.«■>*-. ^ 

where q r is any set of coordinates on the phase space. The bracket with L k \ n is defined by replacing 

<9L kin dL kin d dL kin 



dq r dq r dt dq r 



(A.12) 



It just means that whenever we compute a Poisson bracket with a velocity, we first integrate the 
kinetic Lagrangian by parts. This is reasonable because L k i n is anyway defined up to a total time 
derivative only. The result of such a bracket is again an expression which is of first order in time 
derivatives. 

How does this help us to find brackets? Suppose we have a suggestion for the Poisson brackets, 
then is is quite obvious that 

{F,L kin }-{F,H} (A. 13) 

is proportional to the equations of motion. Hence, if for every phase space function F we have 

{F,L kin } = F, (A.14) 

then the brackets do provide the correct equations of motion. In our case, the Lagrangian has a 
rather simple structure, and we can even proof that under the condition ( A.14| ) the brackets coincide 



exactly with the correct Poisson brackets. If we summarize the degrees of freedom of it by a set of 
coordinates q a and those of v by p a , then the kinetic Lagrangian has the form L km = p a M aa q a , i.e. 
it is linear in both q a and p a , and the matrix M aa depends on the q's only. We do not assume that 
M aa is quadratic or even invertible. 



Now suppose we have the Poisson brackets such that the q a commute and ( A.14 ) holds. Take 
F = q b : 

Qb = {qb,PaM aa q a } = {qb,Pa}M aa q a - (A.15) 
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Thus M aa is left invertible and {qb,Pa}M aa = 5 a b- Then choose F = pp: 

Pf3 = {Pt3,P a M aa q a } 

= {Pf3,Pa}M aa q a + p a {p{3,qb}d b M aa q a 

- PadbMaaijblp/3, q a } - p a M aa {pp, q a } (A.16) 

where db denotes the derivative with respect to the variable qb- The first three terms must cancel and 
from the last term we infer that M aa is also right invertible, the bracket {q a ,Pa} being the inverse. 
Some more algebra then shows that cancelation of the first three terms implies 

{Pa,P/3}M aa Mj3b = ppdaMpb - Pa dbM aa . (A.17) 

Using this and the previous results we find 

{PaMa^ppMpb} = 0, {p a M aa , q b } = 5 ab . (A.18) 

Now p a M aa is exactly the "canonically conjugate" momentum of q a , so the brackets are indeed the 
correct ones, which one can equivalently obtain directly from Lkin- 

Obviously, deriving the brackets from in the usual way seems to be much simpler. However, 
observe that ( A. 14 ) does not require a suitable basis of the phase space, like q a ,Pa above, to check 
whether the brackets are correct or not. In fact, it is not easy to find a simple basis for quasiperiodic 
fields. However, there is a simple (over-complete) set of phase space functions to test ( A. 14 ), namely 
the values u((j)),v ((/>), (j) £ M. A disadvantage of this method is that it does not help us to derive the 
brackets. The ansatz is, however, motivated by using the general methods for group valued canonical 
fields Q. 

Because of the derivative acting on v in the Lagrangian, we expect the brackets to be non-local, 
and in fact the full brackets of u with v and of v with itself are rather awkward. They simplify 
considerably (but are still not as simple as those for tjj and -ft) if we fix 0o appearing in (ET27D , and 
consider v((f>o) and 7</>(0) as independent variables instead of v((j>), as we already did above. The 
kinetic Lagrangian expressed in terms of these variables is 

L kin = E (Mu^uev^o)) - <f d<f> Ttiu^u^)). (A.19) 

£ J<t>Q 

Let us start with the bracket of u{(p) with v{4>q). We make the ansatz 

Tr(a^(0 o ))} = A(a, - </> ) u(<f>), (A.20) 

where A(a, (j)) is an s((2) valued function depending linearly on a, and possibly also on u. Note that 
00 was arbitrary in ( |A. 19| ) , so this is the complete bracket of u and v. We can now compute the 
bracket of v(4>q) with the periodicity parameter ug and get 

{u e , Tr(at>(<A)))} = {«(0 + 2tx)u'\4>), Tx{av{^))} 

= A(a, <p - 0o + 2tt) ui - U£ A(a, <fi - <f>o). (A. 21) 

Now the right hand side must be independent of eft. Hence, there must be a matrix Ao(a) G sl(2) 
such that 

A(a, 4> + 2tt) ui = u e A(a, <f>) + u e A {a). (A.22) 

As the last bracket is then also inpendent of 0o, we infer that the bracket of up with 7^ = u~ l d^vu 
vanishes. This leads to the following bracket of U£ with L^ Q , which has to reproduce the velocily of 

Uf. 

{u£, L kin } = {ui, Tr(tt^ 1 w^t;(0o))} = uiA (u£Ui) = ii£. (A.23) 
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Thus we must have Ao(a) = a - To get the correct bracket of u with the kinetic Lagrangian, it turns 
out that the only additional requirement is 



A(a, 4>) = for < <f> < 2vr. (A.24) 



Together with the quasiperiodicity ( A, 22] ) this defines A uniquely as a stepfunction in eft, up to the 



indeterminate values at the points (f> = 2irz. One can easily check (by calculating the hights of the 
steps recursively) that the derivative of the stepfunction is 

d^A{a, 0) = J2( u ^ a 5 ^ ~ 2nz )- ( A - 25 ) 

Differentiating ( A.2CQ with respect to <j> or 4>q and using the quasiperiodicity relation for u gives 

{u(4>), Tviau^d^v u(4> ))} = - u(4>)a5{<f> - <j> Q ), 
{u-%u(<l>), Tr(a^(0o))} = ^ 1 (0o)au(0o)?(0-^o), (A.26) 

where 5 ((f)) = Yl^i^ ~ 2vrz) is the periodic delta function on the circle. This is already the bracket 
given in ( |3.33| ), as u~ 1 d ( pvu = 7^, and u d^u = u^. Using these formulas we can compute the 
bracket of tt(</>) with L^ n . As we can choose </>o in ( A.19| ) arbitrarily, we take 4>o < <f> < 0o + 27r, i.e. 



lies inside the range of the integral and the bracket of u(<j)) with v(J>q) vanishes by (A.24): 

{u{0),L kin } = {u(0),- I d<l>' Tr^tt^)} 

d(p'u(4>) u- 1 ^') u{<j)') 8{<j) - ^') = u(<f>). (A.27) 

Finally, we have to define the brackets of v with itself. They are fixed by requiring that any bracket 
of v with itself is again linear in v, and of course the Jacobi identities must hold. Taking the bracket 
of ( A.2C| ) with Tr(bv((j)o)) (with the same angular coordinate again), and antisymmetrizing in a,b, 



leads to the bracket 

{Tr(av(ct> )), Tr(b^o))} = - Tr([a, 6]«(fo)). (A.28) 



Note that this is valid only if the two w's are taken at the same point. From the first line in (A.26) 
we infer that Tr(a^) acts on u by multiplying it from the right with a, whereas Tr(av) in ( A.2C ) 
acts by multiplication from the left with some other matrix. Hence, these two actions commute and 
using the Jacobi identities gives 

{ Tr(a u^d^v «(</»)), Tr(fw(<£ ))} = 0. (A.29) 



This is the first line of ( |3.33 ) and can be transformed into a differential equation for the commutator 



of v(4>) with v(<f>o), which is obviously linear in v again, because we have the initial condition ( A.2S| ). 



Note that, in constrast to the bracket ( A.20Q , that between the t>'s is an integrated stepfunction, and 



hence it is continuous in the angular coordinate, so that its value at the special point ( A.28| ) is well 
defined. We do not need the full bracket explicitly to check whether they are correct. If we want to 
compute the bracket of v((p) with the kinetic Lagrangian, we can take <j) = 4>q, because 4>q is arbitrary 



in ( A. 19] ). Then v((f>o) commutes with the integral, as it commutes with 7^ and by ( A.2CQ it also 



commutes with u((j>) within the range of the integral (f)Q < <p < (fro + 2ir (there is no distributional 
contribution at the boundary, just a finite one which does not enter into the value of the integral). 
So we only have to consider the bracket with the first term. A formula similar to ( |A.5[ ) can be used 
to simplify the calculation: 

{X, u^iii} = ui l d t {{X, ui>}ui l ) U£, (A.30) 
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where dt acts on ui only. Using this we find with a partial integration of the dt : 
{Tr(av(</> )), Triu^utvifo))} 

= ^(uiau^dtiuiv^o)^ 1 ) - [a,uj l ui\v(^ )) = Tr(av(4> )). (A.31) 



Hence, with the brackets introduced, ( A.14| ) holds and they are the correct Poisson brackets. 
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